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Abstract. We introduce tree- width for first order formulae tp, fotw (tp). We show that 
computing fotw is fixed-parameter tractable with parameter fotw. Moreover, we show 
that on classes of formulae of bounded fotw, model checking is fixed parameter tractable, 
with parameter the length of the formula. This is done by translating a formula tp with 
fotw (99) < k into a formula of the k- variable fragment C k of first order logic. For fixed k, the 
question whether a given first order formula is equivalent to an C k formula is undecidable. 
In contrast, the classes of first order formulae with bounded fotw are fragments of first 
order logic for which the equivalence is decidable. 

Our notion of tree-width generalises tree-width of conjunctive queries to arbitrary for- 
mulae of first order logic by taking into account the quantifier interaction in a formula. 
Moreover, it is more powerful than the notion of elimination-width of quantified constraint 
formulae, defined by Chen and Dalmau (CSL 2005): for quantified constraint formulae, 
both bounded elimination-width and bounded fotw allow for model checking in polyno- 
mial time. We prove that fotw of a quantified constraint formula ip is bounded by the 
elimination-width of tp, and we exhibit a class of quantified constraint formulae with 
bounded fotw, that has unbounded elimination-width. A similar comparison holds for 
strict tree-width of non-recursive stratified datalog as defined by Flum, Frick, and Grohe 
(JACM49, 2002). 

Finally, we show that fotw has a characterization in terms of a cops and robbers game 
without monotonicity cost. 



1. Introduction 

Model checking is an important problem in complexity theory. It asks for a given formula 
tp of some class C of formulae and a structure A, whether A satisfies tp. 



MC(C) 



Input: A structure A and a formula tp £ C. 
Question: A (= tp? 
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Let C denote first order logic. It is well-known, that MC(£) is PSPACE-complete. 
Motivated by this, much research has been done on finding fragments of C having a tractable 
model checking problem. For instance, for fixed k, the problem MC(£ fe ) can be solved in 
polynomial time, where C denotes the fragment of first order formulae with at most k 
variables (see e.g. [TT]). 

The class of conjunctive queries, CQ, is an important fragment of first order logic. Many 
queries that occur in practice are conjunctive queries, and model checking of conjunctive 
queries on relational databases (i.e. relational structures) is an important and well-studied 
problem in database theory [291 El HI EQl [2Q1 [23]. It is equivalent to conjunctive query 
containment, to the constraint satisfaction problem studied in artificial intelligence and to 
the homomorphism problem for structures [71 [15]. A conjunctive query is a first order 
formula starting with a quantifier prefix using only existential quantifiers, followed by a 
conjunction of relational atoms. While MC(CQ) is NP-hard in general, several researchers 
proved independently that conjunctive queries of bounded tree-width can be evaluated in 
polynomial time [8], [18]. One way to prove this is the following. Suppose 99 is a conjunctive 
query having tree-width k. Then we can compute a tree decomposition of width k in 
linear time using Bodlaender's algorithm [6j. From the decomposition we can actually 
read off the syntax of an equivalent formula 99' E C k+1 . Finally, we use the fact that 
MC(£ fc+1 ) is solvable in polynomial time. Essentially, bounded tree- width is even necessary 
for polynomial time solvability of MC(CQ) [221 [22] , 

In this paper, we introduce a notion of tree-width for first order formulae <p, fotw^). 
Our notion generalises the notion of tree-width of conjunctive queries, and we show that 
the class Ck of all first order formulae 99 with fotw(c^) < k satisfies the following properties. 

(1) Ck has a polynomial time membership test (Corollary 14.81) . 

(2) Ck has the same expressive power as C k+1 , the fragment of first order formulae with at 
most k + 1 variables (Theorem I5.5P . 

(3) There is an algorithm that computes for given 99 € Ck an equivalent formula <p' = <p 
with ip' G C k+l (Theorem [53]). 

(4) MC(Cfc) is fixed parameter tractable with parameter the length of 99, i.e. for input 
if G Ck and A, the running time is p(||.-4||)/(M) for a polynomial p and a computable 
function / (Corollary I5.6p . 

Obviously, properties 1 and 3 imply property 4. While MC(£ ) is solvable in polynomial 
time, we do not obtain a polynomial algorithm for MC(Cfe). Nevertheless, in typical ap- 
plications one can expect the length of the formula to be small compared to the size of the 
structure (database). For a fixed formula the running time is polynomial, and moreover, 
the problem is fixed-parameter tractable (in fpt), meaning that changing 99 does not alter 
the exponent of the polynomial (see [HI IT?])- 

Note that for fixed k > it is undecidable, whether a first order formula 99 is equivalent 
to an C formula. Hence it is not surprising that our notion of fe-bounded first order tree- 
width does not capture semantic equivalence to C k (we will give more details in Section [5]) . 

Quantified constraint formulae generalise conjunctive queries by allowing arbitrary 
quantifiers in the quantifier prefix. In [9], Chen and Dalmau introduce elimination or- 
derings for quantified constraint formulae. These elimination orderings must respect the 



quantifier prefix. In this way, Chen and Dalmau obtain a notion of elimination-widt}\\, which 



spa 



Actually, the notion is called tree-width for quantified constraint formulae in [5] , But since the notion is 
defined via elimination orderings, we prefer the term elvmination-width. 
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allows for model checking of quantified constraint formulae of bounded elimination-width 
in polynomial time, using a consistency algorithm. Hereby, they answer a question posed in 
|19] positively, whether bounded tree-width methods work for formulae more general than 
conjunctive queries. Introducing a notion of tree-width for arbitrary first order formulae, we 
even go further. We show that for quantified constraint formulae 99, elimination-width of ip 
is at least as large as fotw(</?), and we exhibit a class of quantified constraint formulae with 
bounded first order tree-width and unbounded elimination-width. We show that quantified 
constraint formulae of bounded fotw allow for model checking in polynomial time. Hence 
fotw is more powerful than elimination-width. 

In |16| . Flum, Frick and Grohe introduce strict tree-widtnfl for non-recursive stratified 
datalog (nrsdJ programs. They show that model checking for NRSD programs of bounded 
strict tree-width can be done in polynomial time. Since nrsd programs have a canonical 
translation into first order formulae, our notion of tree-width can be transfered from first 
order formulae to nrsd programs. We show that if an nrsd program n has strict tree- width 
at most k, then the formula </?n obtained from n has elimination- width at most k and hence 
it satisfies fotw(</?n) < k. Again there are classes of nrsd programs with unbounded strict 
tree-width, whose corresponding first order formulae have bounded first order tree-width. 
Hence our notion of first order tree-width yields larger subclasses of C, that still allow for 
tractable model checking. 

Actually, we introduce first order tree-width as a special case of a more abstract notion 
which we term stratified tree-width. We expect that stratified tree-width will find further, 
quite different, applications. 

The rest of this paper is organised as follows. Section[2]fixes some terminology. Section[3] 
introduces the notion of stratified tree- width, the special case of first order tree-width, and 
the notion of xenerp normal form of a formula <p - a kind of opposite of prenex normal 
form. We show that fotw is invariant under transformation into xenerp normal form. In 
Section 13.41 we relate fotw to the natural notion of tree- width stratified by the alternation 
depth of a formula. In Section [4] we show how to compute stratified tree decompositions 
and, in particular, how to compute first order tree-width. In Section [5] we prove that 
bounded first order tree-width is expressively equivalent to bounded variable fragments of 
first order logic and that model checking for formulae of bounded first order tree-width is 
fixed-parameter tractable. In Section [6] we relate our notion to existing notions and give 
a game characterisation of stratified tree-width. We conclude with some open problems in 
Section 

We wish to thank the anonymous referees for many useful suggestions. 

2. Well-known definitions 

A vocabulary a = {R\, ■ ■ ■ , R n , c\, . . . , c m } is a finite set of relation symbols R4, 1 < i < n, 
and constant symbols Cj, 1 < j < m. Every Ri has an associated arity, an integer ar(i?j) > 0. 
A a-structure is a tuple A = (A, Ry, . . . , R^, cy, . . . , c^) where A is a finite set, the universe 
of A, Rf C A ar (^) for 1 < i < n, and cf G A for 1 < j < m. 

Given a u-structure A we distinguish between the cardinality \A\ of the universe A of 
A and the size \\A\\ of A, given by \\A\\ = \a\ + \A\ + Yh=i \ R f\ ' ar(-Rj). 



In [16] . the authors also introduce a notion of tree-width for first order formulae. But their notion 
disregards the quantifier interaction, and they only use it for conjunctive queries with negation. 
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We use £ to denote relational first order logic with constants, and for simplicity, we 
refer to C as first order logic. We assume that the reader is familiar with the basic notions of 
first order logic (see for instance [II]). For a formula 93 we let free(</?) denote the set of free 
variables of ip. A formula ip is a sentence, if free(y) = 0. We sometimes write 93(2:1, . . . , x n ) 
to indicate that free(y) C {x\, . . . , x n }. 

For a structure A, a formula 93(2:1, . . . ,x n ), and elements ai,...a n E A we write 
A \= tp(ai, . . . a n ) to denote that A satisfies tp if the variables x\, ■ ■ ■ ,x n are interpreted 
by 01, . . . , a n , respectively. We let 

cp(A) := {(ai, . . . ,a n ) | A \= 93(01, . . . , a n )}. 

For sentences we have ip(A) = true, if A satisfies 93, and false otherwise. If the vocabu- 
laries of 93 and A are different, we let <p(A) = 0. 

The Query Evaluation Problem for a class C of formulae is the following problem: 



Eval(C) 



Input: A structure A and a formula 93 E C. 
Problem: Compute p(A). 



Note that if 93 is a sentence, then Eval(C) and MC(C) coincide. We say that a formula 
93 E C is straight, if no variable in 93 is quantified over twice, if no free variable is also 
a quantified variable, and if each quantified variable actually occurs in some atom. All 
formulae are straight, unless stated otherwise. Moreover, we assume that all formulae are 
in negation normal form, i.e. the negation symbols only appear in front of atoms. 

We denote a graph G as a pair G = (V(G), E(G)), where the set V(G) of vertices is 
finite, and every edge e G E(G) is a two-element subset of V(G). A tree decomposition of a 
graph G = (V,E) is a pair (T,B), consisting of a rooted tree T and a family B = (Bt)t£T 
of subsets of V, the pieces of T, satisfying: 

(TD1): For each v E V there exists t G T, such that v £ Bf. We say the node t 

covers v. 
(TD2): For each edge e € E there exists t E T, such that e C B t . We say the node £ 

covers e. 
(TD3): For each v E V the set {£ E T \ v E -Bt} is connected in T. 
The width of (T, i?) is defined as w(T, 1?) := max | \Bt\ \ t E T} — 1. 
The tree-width of G is defined as 

tw(Cr) := min {w(T, B) I (T,B) is a tree decomposition ofG\. 

Fact 2.1. Every graph G of tree- width at most k has at most k ■ \V(G)\ edges. □ 

Fact 12.11 can be shown by induction on the number of vertices (see e.g. [17]). We will 
make frequent use of the following well-known fact about tree decompositions (see |llj): 

Fact 2.2. Let (T,B) be a tree decomposition of some graph G, and let C C V(G). If for 
all v,w E C, some piece of (T, B) covers both v and w, then there is some piece B t covering 
C entirely, i.e. C C B t . 

In particular, every clique in G is covered by some piece. □ 
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3. First order tree- width 



3.1. Stratified tree-width. We start with defining stratified tree-width. Then, first order 
tree-width is defined as a special case. Although it is our only application of stratified 
tree-width, stating results in greater generality allows us to focus on their essence. It is also 
quite possible, that further applications will arise in the future. 

Any rooted tree T induces a natural partial order <y on its nodes, where the smallest 
element is the root. For a tree decomposition (T,B) of a graph G and a vertex v G V(G), 
let t v G T denote the <y-minimal tree node that covers v. By (TD3), the node t v is well- 
defined. Now, let d: V{G) — > N be a function. We say that a tree decomposition (T,B) 
of G is d-stratified, if all u,v G V{G) with t u <t t v satisfy d{u) < d(v). The tree-width of 
(G, d) is defined as 

tw(G, d) := min{w(T, B) | (T,B) is a d-stratified tree decomposition ofG}. 

It will sometimes be convenient to work with an alternative characterization of stratified 
tree width: let G = (V, E) be a graph and d : V — > N. An elimination ordering of (G, d) is 
a linear ordering (v\, . . . ,v n ) of V which respects d, i.e. i < j implies d(vi) < d(vj). With 
an elimination ordering we associate a sequence of graphs as follows: 

• G n := G 

. F(Gi-i) := V(Gi) \ { Vl }, and 

• E(Gi-i) := {e G E(Gi) | Vi e} U {{u,w} \ u ^ w, {u,Vi},{vi,w} G E(Gi)} for 
1 < i < n. 

The width of the elimination ordering is maXj G r n i{deg(i>j) in Gi}. The elimination-width of 
(G, d), ew(G, d), is the minimum width of an elimination ordering of (G, d). It is well-known 
that the tree- width of a graph G equals the elimination- width of G (see [5]), and this fact 
can be generalised to our setting. 

Theorem 3.1. Let G be a graph and d : V(G) -)■ N. Then tw(G, d) = ew(G, d). 

Proof. Towards a proof of tw(G, d) > ew(G,d), let (T,B) be a d-stratified tree decomposi- 
tion for G of width k. We may assume that (T,B) is small, i.e. all nodes s,t € V(T) with 
s ^ t satisfy B s ^ Bf. Recall that for a vertex v G V(G), t v denotes the <T-minimal node of 
T with v G Bf. We now define an ordering v\, . . . , v n of V{G) such that for all 1 < i,j < n 
we have 

• i < j implies d{v{) < d(vj), 

• there is a piece Bi of (T, B) containing Vi and all the neighbours of Vi in Gi. 
In particular, vi, . . . , v n is an elimination ordering of (G, d) of width at most k. 

Claim 1. There exists a vertex v G V(G) with d(v) maximum, such that v appears in 
exactly one piece Bi of (T, B), and ^ is a leaf. 

Proof. Choose any vertex w with d(w) maximum. If w is contained in a piece B% of (5, T) 
where t is not a leaf (otherwise we are done), then choose a leaf i >t t of T. Let s be the 
parent of I. Choose v G -B^ \ B s (such a f exists since the decomposition is small). Since 
(T,B) is d-stratified and t w <t £ = t v , we have d{w) < d(v), and hence by maximality 
d(w) = d(v), proving the claim. ■ 

Let v n := v. Then we replace G by G n _i, we restrict d and (T,B) to G n _i and we 
proceed by induction. 
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Towards tw(G, d) < ew(G, d), let v±, . . . , v n be an ordering of V(G) of width at most 
k and let Gi, . . . , G n = G be the associated sequence of graphs. For i = 1, . . . , n we define 
tree decompositions for the Gi that respect d and have width at most k. For i = 1 we take 
the trivial decomposition. Given a tree decomposition of Gj_i, we choose a piece containing 
all the neighbours of vt in Gi (such a piece exists, because the neighbours induce a clique 
in Gi-i), and we attach to it a new piece containing Vi and all neighbours of Vi in Gi. Let 
(T, B) be the tree decomposition obtained for G = G n . Obviously, (T, B) has width at most 
k. Moreover, (T,B) is (i-stratified: let Vi,Vj S V(G). Iit Vi <t t Vj , then, by construction, we 
have i < j. Since vi, ■ ■ ■ , v n is an elimination ordering of (G, d), this implies d(vi) < d(vj). □ 

3.2. First order tree-width. For a formula <p, the formula graph is the undirected graph 
Gp, with vertices var (</?), and edges {x,y} whenever x and y are free variables, or when x 
and y occur together in some atom of (p. (If tp is not straight, then we obtain the formula 
graph of <p by first making it straight.) Note that the formula graph depends on the syntax 
of the formula. Logically equivalent formulae may have different formula graphs. 

We now introduce a partial order ^ on the variables of a formula ip, from which we 
then obtain the essential alternation depth, ead ¥ ,(x), of a variable x G var (ip). Given a 
tree decomposition of G^ of width k — 1 that respects ead^, we show in Section [5j how to 
transform the formula ip bottom up along the decomposition into an equivalent £ fc -formula. 
In this transformation, we want to 'reuse' as many variables as possible, so, intuitively, the 
'worst case' is that ip is in prenex normal form. Hence we want to 'undo' prenex normal 
form, pushing quantifiers as far as possible away from the root in the syntax tree. Of course, 
we have to make sure that we obtain an equivalent formula. Intuitively, -< v gives us a partial 
order of quantifications that we have to respect while undoing prenex normal form. 

For a bound variable x, let Q x € {3, V} be the type of quantifier used to quantify x in 
(p. Then the scope of x is the unique subformula ip of ip such that Q x xip is a subformula of 
ip. For bound variables x, y of ip, we write x < v y to denote that x = y or y is quantified in 
the scope of x. For a set X of variables, we use y>\x] to denote the minimal (with respect 
to subformulaship) subformula of ip which contains all atoms using variables from X. 

Definition 3.2. Let < be a binary relation on the variables of some formula ip. Then two 
variables x and y are entangled with respect to < and ip, if x occurs in V[j/<] an d y occurs 
in (p[x<] (as usual, we use x < to denote {x' \ x < x'}). 

Definition 3.3. Let p be a straight formula. Then -<^ is the minimal (with respect to C) 
binary relation on var (ip), such that the following hold. 

(1) ^ is reflexive. 

(2) -<<£ is transitive. 

(3) If x < v y, Q x 7^ Q y and there is a sequence x = zq, . . . , z n = y of bound variables such 
that for all < i < n we have that z%, Zi+i are entangled with respect to ^ and ip and 
that x -< v Zi or y < v Zi, then x ^ y (Alternation). 



In order to see that ^ is well-defined, observe that Definition 13.31 is in fact an inductive 
definition: all three conditions can be restated as closure of ^ under some operator on 
binary relations, and all three operators are monotone with respect to C. The least obvious 
case is the one of the operator underlying Alternation. To establish monotonicity in this 
case, assume that <j and <' are binary relations on v&i(ip) and that (<j) C (<!'). We have to 
show that whenever two variables x,y satisfy Alternation with respect to <, then they also 
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do with respect to <'. For any variable z we have (z <) C (z <'), so P[ z <\] is a subformula of 
V[2<']- Thus, entanglement of some variables with respect to < and ip implies entanglement 
with respect to <' and ip. Hence any witness for Alternation with respect to < is also one 
with respect to <'. 

At many places, we will use proof by induction on the inductive definition of < Lp . 
Therefore, we explicate how the inductive principle works in this case. 

Lemma 3.4. Let ip be a formula and P a property of pairs of variables from ip. If 

(1) P(x,x) holds for all x G var(<^) ; 

(2) x <^y,y <<p z, P(x,y) and P(y,z) imply P(x,z), and 

(3) if x < v y, Q x ^ Q y , (<) C (r^) such that P(x',y') holds for all x' < y' , and for 
some sequence x = zq, . . . , z n = y and all < i < n we have z% S (x <) U (y <) and 
entanglement of Z{ and z%+\ with respect to < and p, then P(x,y), 

then P(x, y) holds for all x, y such that x <^ y. □ 

Remark 3.5. 

(1) The relation <^ is a subrelation of < v : (^ v ) C (< ¥> ), 

(2) the relation -<^ is a partial order, and 

(3) x -<<p y holds whenever x and y are entangled, Q x ^ Q y , and x < v y. 

Proof. 1 follows since < v satisfies all closure conditions. 

2: x ^tp y is reflexive and transitive by definition, and it inherits anti-symmetry from (<«) 

byl. 

3: this follows by letting n = 1 in Alternation. □ 

Example 3.6. Let p := 3x\/y3z(Pxy A *iu{Ryu V Pzu)}. Then x -<^ y and z ^ u by 
Remark 13.51 3, y <^ z by Alternation (witnessed by the sequence y,u,z), and x ^ v z, 
y ^ v u, and x -<^ u by Transitivity In this example, all entanglements are due to the two 
variables in question occuring in the same atom. 



We use <p x as a shorthand for (p\ x ^, i, and we say that x and y are entangled in ip, if 
they are entangled with respect to ^ and ip. Note that this is the case if and only if x 
occurs in <p y and y occurs in <p x . Observe further that <p x is a subformula of the scope of x. 
The idea behind entanglement is to capture interaction between variables. 

Example 3.7. Let ip := Vx\/x'3y(((Py A Px) V Px) A ((Py A Px') V Px')). Then i^}] 
already is the whole quantifier free part of ip, hence so is <p y . Further, p x contains <P\{ X \\ = 
{Py A Px) V Px. Thus x occurs in <p y and y occurs in (p x , so x and y are entangled. It 
follows that x ^u, y, so <p x contains p y . As this is the whole quantifier free part, we have 
Px = <py In a similar way we obtain <p x > = <p y . Consequently, x and x' are entangled as 
well. Intuitively, x and x' interact through y. 

Example 3.8. Let p := Vx3yVz(i?zy V (Px A Py)). Then y and z are entangled, be- 
cause they occur in the same atom, x, however, is not entangled with any other vari- 
able, because *P[{x}] = P x does not contain any variable besides x. The same holds for 
ip := 3y(yzRzy V (VxPx A Py)), which illustrates that x does not interact at all. Thus, 
(^<p) = {(.y,z),(x,x),(y,y),(z,z)}. 



I. ADLER AND M. WEYER 



Example 3.9. For n > let 

ip n := 3xi • • • 3x n My3z j /\ RxiZ A Py 

, l<i<n 

and 

ip n := 3xi . . . 3x n Vy3z I /\ (RxiZ A Py) 

\l<i<n 

In y? n , the only entanglements are between the x% and z. Consequently, < tpn is the equality 
relation on var(<yj n ). On the other hand, both (^\) n )y and (tp n ) z coincide with the quantifier 
free part of ip n , so y and z are entangled in i/j n . It follows that y ~^^ n z. Nevertheless, as y 
does not occur in (VVi)x; = Rxtz, y is not entangled with Xi in ip n . 

Definition 3.10. Let (p be a first order formula and x £ var(c^). The essential alternation 
depth of x in 93, denoted by ead ¥ ,(x), is the maximum over all rf^-paths P ending in x of the 
number of quantifier changes in P, adding +1 in case the first variable on P is existentially 
quantified and +2 if it is universally quantified. If x is a free variable, we let ead^(x) = 0. 

The +1 respectively +2 in the definition makes sure that ead^(x) is odd if and only if 
Q x = 3. 

Example 3.11. The formula from Example 13.61 satisfies ead^(x) = 1, ead ip (y) = 2, 
ead ip (z) = 3, and ead^(-u) = 4. 

The formula from Example 13.71 satisfies ead^(x) = ead^(x') = 2 and ead !fi (y) = 3. 

The formulae from Example 13.81 satisfy ead v ,(x) = ead (y9 (z) = 2, ead (y0 (y) = 1, and 
ead^, = ead<p. 

For the formulae from Example 13.91 we have ead !fin (xi) = ead^ n {x{) = ead^ n (z) = 1, 
ead Vn (y) = ead^ n (y) = 2, and ead^Jz) = 3. 

Definition 3.12. If we replace -<^ by <^ in Definition 13.101 we obtain the (usual) altern- 
ation depth of x in </?, which we denote by ad^. 

Remark 3.13. Every formula cp satisfies ead^ < ad^. 

Example 3.14. For the formuale from Examples 13.61 and 13.71 we have ad^, = ead^,. 

The formulae from Example 13.81 satisfy ad ¥ ,(x) = 2, ad v (y) = 3, ad v (z) = 4, and 
ad,/, = ead^,. 

For the formulae from Example 13.91 we have ad^ = ad^, n = ead^, n . 

Definition 3.15 (First order tree- width). For a formula <p we define the first order tree- 
width of if by fotw(9?) := tw(G ¥ ,,ead ¥ ,). 

Accordingly we say that (T, B) is a tree decomposition for 93, if (T, B) is an ead^-stratified 
tree decomposition for G v . 

Note that for a formula </?, the variables free(ip), as well as the variables of any atom or 
literal in (p induce cliques in G v . In particular, since ead^(x) = for any free variable x, 
by Fact 12.21 it is no restriction to require that the free variables be covered in the root of a 
tree decomposition. 

In general, the difference between iotw((p) and tw(G v ) can be unbounded: 

Proposition 3.16. For every n > there is a formula (p n with fotw(</? n ) = n and 
tw(GW) = l. 
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Proof. Let 

ip n = 3xi... 3x n My /\ Exm ■ 

l<i<n 

Then G Vn is the re-star with center y, and we have ead Vn (xi) = 1 for all 1 < i < n, and 
ead v , n (y) = 2. It is easy to see that any ead ¥ , n -stratified tree decomposition (T,B) of G Vn 
has a piece {x±,X2, • • • ,x n ,y}, namely B ty . On the other hand, such a tree decomposition 
needs no other pieces. Hence fotw(c/? n ) = n. Since G (fin is a tree we have tw(G ((0n ) = 1. □ 

Lemma 3.17. Given a formula ip, we can compute -<^ and ead^ in polynomial time. 

Proof. In order to compute ;^«, consider the three closure operators implicit in its definition. 
As -< v is a binary relation on var(c/?), a quadratic number of applications of the closure 
operators suffices to produce <^. Hence it remains to show that each closure operator is 
computable in polynomial time. This is immediate for Reflexivity and Transitivity. For 
Alternation, let (<) C var((£>) 2 be the current approximation of ^ v . First, we compute the 
formulae <P[ Z <\] for all variables z, and from these the entanglement relation. Then, checking 
whether some pair (x, y) needs to be added to R because of Alternation basically amounts 
to reachability in the entanglement graph restricted to (x <) U (y <). 

It is clear that ead^ can be computed from <p and -<^ in polynomial time. □ 

3.3. Xenerp normal form. Prenex normal form aims to make the scopes of quantifiers 
as large as possible. Working in the opposite direction, we obtain what we term xenerp 
normal form. 

Definition 3.18. A subformula x °f a formula ip is in xenerp normal form with respect to 
ip, if for all variables x quantified in \ t ne following holds: <p x is immediately preceeded by 
a quantifier sequence which contains Q x x. 

A formula (p is in xenerp normal form, if it is in xenerp normal form with respect to 
itself. 

Example 3.19. Recall the formulae ip and ijj from Example 13.81 We have already seen 
that ead^ = ead^,. Furthermore, ip and t/j are equivalent and ip is in xenerp normal form, 
whereas ip is not in xenerp normal form. 

The following lemma presents equivalent transformations of formulae, such that neither 
the formula graph, nor the essential alternation depth, nor the corresponding tree decom- 
positions change. 

Lemma 3.20. Let ip and ip be formulae satisfying either 1, 2 or 3. 

(1) There are formulae Xl>--->Xn> a positive Boolean combination 9 of n arguments, and 
a variable x which does not occur in X2i ■ ■ ■ ,Xn, such that ip is obtained from ip by 
replacing a subformula 9(Q x xxi,X2, ■ ■ ■ ,Xn) by Q x x9(xi,X2, ■ ■ ■ ,Xn)- 

(2) There are a formula X; an d variables x, y with Q x = Q y such that ip is obtained from ip 
by replacing a subformula Q x xQ y yx by Q y yQ x xx- 

(3) There are a formula \, and variables x,y with (p x a proper subformula of <p y , such that 
ip is obtained from ip by replacing a subformula Q x xQ y yx by Q y yQ x xx, where Q y yx is 
xenerp with respect to ip. 
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Then tp = ip, ead^, = ead,/,, and G v = G^. Consequently, tree decompositions for p coincide 
with tree decompositions for ip and in particular fotw(c^) = fotw(?/;). 

Implicitly, we assume in these cases that Q x and Q y are the same with respect to the 
formula p and with respect to the formula ip. 

Proof. In all three cases the formulae differ only in the position of quantifiers, so G v = G^ 
is immediate. Also, for all X C var(<^) = var (?/>), the formulae P\x] an d V'fxi are essentially 
equal: the only potential difference between them is the same shift of quantifiers which 
led from p to ip. In particular, the same variables occur in p<x] as i n fax]- Hence, all 
differences between < v and <^ (and thus between ead^ and ead,/,) must ultimately stem 
from differences between < v and <^. 

For the first replacement, the equivalence p = ip is well-known. For the other parts of 
the statement, the only change between < v and <^ is, that for all variables y quantified in 
Xi for some 2 < i < n, we have x ^ y but x <w, y. We show that this change has no impact 
on ■<. Clearly {■< {p ) C (^) because of (< v ) C (<,/,). For the converse we use induction on 
derivations. More precisely, we show by induction on the inductive definition of ^i/> that for 
all x 1 <^ y' we have x' <^ y' . That is the inductive property P(x',y') as in Lemma l3~il is 
x' ^np y' ■ The first two inductive rules are trivial, because we know that -< v is reflexive and 
transitive. Hence we can concentrate on Alternation. So let x' <v, y' , (<) C (^) (~l (^ v ), 
and x' = zq, . . . ,z n = y' be given such that Q x ^ Q y , and for all < i < n we have 
Zi € (x' <) n (y' <) and entanglement of Zi,Z{+i with respect to < and ip. By our above 
observation on the similarity of p\x\ an d ip\x\ f° r an y set X of bound variables, it follows 
that Zi and Zi+i are also entangled with respect to < and p, and thus with respect to < v 
and p. Consequently, either x' -<^ y' holds (and we are done), or x' ^ y' . Then, together 
with x' <i£ y' it follows that x' = x and y' is quantified in Xj f° r some 2 < j < n. As re <^ Z{ 
or y' <(p z i for all < i < n, all Zi are quantified either in Q x %Xi or i n Xj- As the former is 
true for zq = x and the latter for z n = y', there is some < i < n such that Z{ is quantified 
in Q x xxi a nd Zj+i in Xj- Then yr 2i <] is a subformula of c/9 2i , which in turn is a subformula 
of the scope of Z{ and thus of Q x xxi- Similarly, the scope of Zi+i is a subformula of Xj- As 
all occurences of Zi+\ are in its scope and Xj is disjoint from Q x xxi, this contradicts the 
fact that Zi + \ occurs in P[ Zi <i\. 

In the second replacement, the equivalence p = ip also is well-known. So let us show 
(z^) = (^ip)- As the replacement is symmetric, it suffices to show (^) C (^), which we 
do by induction. Again, the cases of Reflexivity and Transitivity are clear. For Alternation, 
let < and x' = zo, . . . , z n = y' be given as above. Again, we obtain that Alternation also 
yields x' -< v y' unless x' ^ v y'. But then x' = x and y' = y contradicting Q x = Q y . 

In the third replacement, if Q x = Q y then this case is subsumed by the second re- 
placement so we may assume Q x ^ Q y . Let us start by showing p = ip. We will even 
show Q x xQ y yx = QyyQx^X- As Q y yx is in xenerp normal form we have, for variables v, w 
quantified in Q y yx, that p w is a subformula of p v whenever v < v w. In this case w occurs 
in p v (as the formula is straight, w does occur somewhere, and it can only occur in p w 
which is a subformula of p v ). So for entanglement of v and w it suffices to show that v 
occurs in p w . Let V := {v £ v&r(p>) | y < v v and p x is a subformula of p v }. Observe, that 
V is linearly ordered by < v . 

Claim 1. There is a subformula 9 of x which is a superformula of p x such that no variable 
from V occurs free in 6. 
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Proof. For contradiction, assume the opposite. Then we inductively define a sequence 
vo,vi,... of variables. Take vq to be x. For all i > 1 we will have V{ € V. For defin- 
ing Vi + i from Vi, let 9{ be ip Vi , together with the sequence of quantifications of variables 
from V which immediately preceeds <p Vi . Hence 9i is a subformula of Q y yx- If @i = QyVXi 
then we terminate the sequence. Otherwise, by our assumption, some variable from V 
occurs free in 9{. Then let t>j+i be such a variable. As Q y yx is xenerp, B% is a proper 
subformula of <p Vi+1 . This implies that V{ occurs in f Vi+1 and the converse holds by choice 
of i>i+i, because 9i essentially coincides with ip Vi . Hence Vi and v,i + \ are entangled whenever 
both are defined. If furthermore i ^ 0, then we also have vi + \ < v Vi and vi + \ ^ Vi, because 
QyDX is xenerp and ip Vi is a proper subformula of <p Vi+1 ■ Hence, as V is finite, the sequence 
vo, . . . must terminate, say with v n . As ip x is a proper subformula of ip y , we have 9q 7^ QyVX: 
so n > 0. Now let < m < n be minimal such that Q Vm = Q y . The case that there are 
no such m will be handled later. As Q Vo = Q x ^ Q y , we have m > so ip x is a proper 
subformula of <£>„ m . We obtain v m <^ Vi for all 1 < % < m using a backwards induction as 
follows. The base case is Reflexivity. For the inductive step we have a chain v m ,...,Vi of 
entanglements and by the inductive hypothesis we have v m ^ Vj for all intermediate j > i. 
Further v m < Vi and Q Vm = Q y ^ Q Vi , so we obtain v m < v Vi using Alternation. Then, in 
total we have a chain x = vo, v±, . . . , v m of entanglements such that x -< v vq by Reflexivity, 
v m ^tp Vi for all 1 < i < m and Q x 7^ Q y = Qv m , so Alternation implies x -< v v m . But 
then (v m ^„) C (x ^^), contradicting that <p x is a proper subformula of f Vm - Now for the 
case that Q Vi = Q x for all i such that Vi is defined. Then 9 n = Q y yx, so ip Vn = ip y and 
thus v n and y are entangled. This time we have a chain y,v n , ... ,vi,x of entanglements 
with the same properties as the sequence v m , . . . ,v±,x above: it is descending with respect 
to <tp except for the last step, the first variable has the same quantifier as y and all other 
variables have the same quantifier as x. Thus similar to the above we obtain x ^ v y, this 
time in contradiction to <p x being a proper subformula of <p y . This proves the claim. ■ 

So let 9 be a subformula of x an d a superformula of ip x such that no variable from V 
occurs free in 9. Hence all free variables of 9 except x are also free variables of Q x xQ y yx- 
Without loss of generality we may assume that Q x = 3 and Q y = V. It is well-known that 
3x\/yx implies VyBxx- For the converse, let I be an interpretation for BxVyx such that 
Z \= \/y3xx- We need to show that X \= BxVyx- For a value a from the universe of the 
interpretation I, we have that I- is an interpretation for Vyx- As all free variables of 9 
are also free in Vyx> we have that I- also is an interpretation for 9. Now let a$ be some 
value such that I— \= 9. If no such ao exists, let instead ao be arbitrary. Now for all a we 
have that X— \= 9 implies I— \= 9. Now let b be an arbitrary value. As X \= VyBxx, we 
have that X\ := X-^ \= x for some value a! . Let us examine the impact that replacing X\ 
by X2 := X-^- has on the formula x- Recall that (p x is a subformula of 9, so in particular 
x does not occur free in x except in 9. Hence the only possible change comes from 9. If 
X\ |= 9, then also I2 \= 9 and nothing changes, that is X2 |= x- The same argument holds 
if X±,X2 \/= 9. Otherwise X\ ^= 9 and I2 |= 0. In this case recall, that all formulae are 
in negation normal form, so % is positive in 9. Thus X\ \= x again implies I2 |= x- So 
XMA = ik ssl |= x f or al i values 6, which implies X^f \= \/ yx and then X \= BxVyx- 

Next, let us compare ^ v with <^. The only difference between <^ and <^ is, that 
x < v y i^p x while x ^ y <y, x. To show (^ v ) = (^i/>), we show inclusion in both 
directions by induction. Let us start with (^w) C (^•0). As in the proofs for the other 
replacements, the only interesting case is where x' -< v y' but x' ^ y' . This implies x' = x 
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and y' = y. Then x <^ y in contradiction to ip x being a proper subformula of ip y . For the 
other inclusion we need to recall a little more from the above cases: we obtain a sequence 
of entanglements (which are such both in ip and in <p) x' = zo, ■ ■ ■ , z n = y' such that for all 
< i < n we have x' <^ z$ and x' <^ z.- L or y' <^ Zi and y' < v z.- L . Also we have x' <^ y' 
and we are done unless x' ^ v y', hence x' = y and y' = x. But then we have x < v y and 
a chain x = v n , . . . ,vq = y of entanglements. As further x -<^ v n by Reflexivity, we obtain 
x ~^tp y which again gives a contradiction. This concludes the proof of Lemma 13.201 □ 

Observe, that the first class of replacements in the previous lemma are exactly what is 
used in turning a formula into prenex normal form. For xenerp normal form, we need the 
first and third class. 

Corollary 3.21. Let (p be a formula and let ip be a prenex normal form (obtained in the 
usual way) of if. Then fotw(ip) = fotw(ip). □ 

Lemma 3.22. From a formula ip we can compute in polynomial time a formula ip in 
xenerp normal form, such that <p = ip, G v = G^, ead^ = ead^, and consequently, tree 
decompositions for ip coincide with those for ip. 

Proof. We work by applying replacements from Lemma 13.201 in one direction or the other. 
More precisely, we apply replacements of the first kind backwards whenever possible. When- 
ever no such replacement is applicable, then each quantifier sequence ends with some quanti- 
fier Q x x such that the scope of x is an atom, a negated atom, or a conjunction or disjunction 
of two subformulae, both of which contain x. In all three cases, <p x contains the scope of x. 
As the reversed inclusion always holds, we have that Q x x immediately precedes (p x . 

If the formula ip' at hand is xenerp, we are done. Otherwise let x be ^/-maximal 
such that the quantifier sequence containing Q x x does not precede <p x . By the above, this 
sequence does not end with Q x x, so the scope of x has the form Q y yx- By maximality of x, 
Q y yX is xenerp with respect to ip' . In particular, the quantifier sequence containing Q x x is 
followed by ip y . By the choice of x we conclude that tp x ^ ip y , so tp x is a proper subformula 
of ip y . Hence a replacement of the third kind is applicable (in the forward direction). 

It remains to show, that repeatedly applying these replacements terminates. As a first 
semi- invariant, consider the sum, taken over all variables x, of the distance that Q x x has 
from the root in the syntax tree of the formula. (Backwards) replacements of the first kind 
increase this semi-invariant while replacements of the third kind do not change it at all. 
On the other hand, replacements of the third kind decrease the number of variable pairs 
(x,y), such that x <<v y and p x is a proper subformula of <p y . As both semi-invariants 
are polynomially bounded in \<p\, and each replacement (including the test for applicability) 
requires only polynomial time (recall Lemma l3.17p . the procedure runs in polynomial time. 

□ 

3.4. Comparing ead with ad. Let (p be a first order formula. As <^ is coarser than <„, 
it is immediate that ead^, < ad^. However, this does not directly imply that fotw((/j) < 
tw(G v ,,ad^ ) ) also holds. Variables which are incomparable by < v (and thus by < v ) are 
given an order by ead^, and by ad^, but not neccessarily the same one. Consequently, 
not every ad^-stratified tree decomposition is also ead^-stratified. Nevertheless fotw^) < 
tw(G v ,,ad (/5 ) does hold, and it is the purpose of this subsection to show this fact. 

At the core of our proof there is a double induction which we cast into two auxiliary 
lemmata. An entanglement chain (in some formula ip) is, of course, a sequence Vq,... ,v n 
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of variables such that for all < i < n the variables v% and Vj+i are entangled in ip. The 
entanglement chain is hanging, if vo,v n <« Vi for all < % < n. It is crossing, if Q VQ ^ Q Vn . 
It is nice, if min< (^o^n) z^ Vi for all < i < n. (We will only talk about nice chains in 
contexts where the existence of mm.< v (vQ,v n ) is guaranteed a priori. As a side note it is 
not hard to see that this minimum exists for all hanging chains.) 

Lemma 3.23. Let ip be a formula such that < v is a total order (for example p is in prenex 
normal form). Let x = V- m ,...,V-i,y = vq,v\, . . . , v n = z be an entanglement chain 
without repetitions such that the following hold: 

(1) Qx + Qy = Qz- 

(2) x <tp Vi for all —m < i < n. 

(3) The subchain y = vq,v\, . . . ,v n = z is hanging. 

(4) x <^ Vi for all —m < i < 0, i.e. the subchain x = v_ m , . . . ,v-\,vq = y is nice. 

(5) For all < k < £ < n, if the subchain v^, . . . , vi is hanging and crossing, then it is also 
nice. 

Then the chain is nice. 

Proof. Otherwise assume a counterexample with n minimal. Obviously, n > 0. If n = 1, 
then we can use the chain to derive either x ^ z or z ^ x by Alternation. In the first 
case we are done, in the second case we use Transitivity with x to derive z ^ v i for all 
— m < i < 0. 

Hence in the following we may assume n > 1. Let < k < n be such, that v^ <<p Vi for 
all < i < n, that is the subchains vq,... ,Vk and v^, ■ ■ ■ , v n are hanging. If Q Vk = Q x , then 
the last premise gives y ^ Vi for all < i < k and z ^ Vi for all k < i < n. The former, 
together with x -<^ vq = y, gives x ^ v Vi for all < i < k. Hence, for all —m < i < n we 
have x ^ v Vi or z -<^ Vi. Thus the chain again witnesses x ^ zoiz ^ x, so transitivity 
also gives x <^ Vi for all k < i < n or z ^ Vi for all — m < i < k. 

So we are left with the case Q Vk = Q y = Q z - Then the subchain x = V- m , . . . ,v^ 
satisfies all conditions of this lemma. Minimality of n implies that x -<^ Vi for all —m < 
i < k. Now we set m! = m + k, n' = n — k, and v[ = Vi + k for all —m' < i < n'. 
The thus shifted entanglement chain again satisfies all conditions, so minimality of the 
counterexample implies that the shifted chain is nice. Then so is the original chain. O 

Lemma 3.24. Let ip be a formula such that < v is a total order. Then every hanging and 
crossing entanglement chain is nice. 

Proof. Otherwise assume a counterexample vq, . . . ,v n with minimal n. The fact that the 
chain is crossing prohibits n = 0. If n = 1, the claim follows from Alternation. Hence we 
assume n > 1. Let < k < n be such, that v^ < v Vi for all < i < n. We have Q Vk = Q x or 
Qv k = Qy, without loss of generality the latter. Then by minimality of the counterexample 
the shifted entanglement chain with v^ in the middle satisfies all conditions of Lemma l3.231 
Hence the claim follows from that lemma. □ 

Theorem 3.25. For all <p € C we have fotw((£>) < tw(G ¥ ,,ad ¥ ,). 

Proof. Without loss of generality, we may assume that p is in prenex normal form: otherwise 
let ip' be a prenex normal form of p such that ad^ = ad^/. Such a p' can be obtained by 
moving quantifiers to the left in p (which is the normal procedure for making a formula 
prenex) while always chosing a variable with minimal ad^. Among the prenex normal 
forms of p, this <p' is a sensible prenex normal form anyway, in that it does not introduce 
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unneccessary alternation. Observe that dp = Gw. Now by Lemma [3.201 we have fotw^) = 
fotw(<//) while the choice of ip 1 implies tw(G ip ,ad v ) = tw(G (/p ',ad ¥ ,'). 

By virtue of Theorem l3.1l it suffices to prove ew(G (/3 , ead^) < ew(G (/3 , ad v ). We show that 
for all formulae ip in prenex normal form, all k, and all elimination orderings of {G v , ad v ) of 
width k, there is an elimination ordering of (G^,, ead^) of width k. Let ip = Q Xl xi . . . Q Xn x n 
where 9 is quantifier free. 

For any ordering yi, . . . ,y n of {x\, . . . ,x n }, a ip- fault is a pair 1 < i < i' < n such that 
ead^d/j) > ead v (?/j/). Let us assume that <p and yi,...,y n form a counterexample with 
a minimal number of 92-faults, that is yi,...,y n is an elimination ordering of (G^ad^,), 
(Gy,, ead^) has no elimination ordering of equal width, and the number of ^-faults is minimal 
for all choices of ip and y\, . . . ,y n . If this number of 99- faults is 0, then y\, . . . , y n also is 
an elimination ordering of (G^ead^,), contradicting the counterexample property. Hence 
there is some 93- fault. 

Let k be the width of y±, . . . , y n with respect to G v . Let ip := Q yi y\ ■ ■ ■ Qy n y n 8- As 
2/1, . . . , y n is obtained from x\, . . . , x n only by rearranging variables within quantifier blocks, 
we can obtain ip from ip by a sequence of replacements as in Part 2 of Lemma 13.201 Hence 
that Lemma implies G v = G^ and ead^ = ead^,. Also, ad v = ad,/,, so in particular yi,...,y n 
is an elimination ordering of {ip, ad,/,) of width k. By Definition of ip, we have yi <,/, y^ if 
and only if i < i'. Furthermore, <p- faults and ip- faults of y\, . . . , y n coincide. 

As there is a ip- fault, there also is one which concerns two subsequent variables, that 
is for some i we have ead^(yj) > ead^,(yj + i). Let z\,...,z n be the sequence y\, . . . , y%-\, 
Ui+i, Vi, Vi+2,--- ,Vn, i-e. yi and y i+1 change places. Further, let \ ■= Qz^i . . .Q Zn z n 6. 
Obviously, z%, . . ■ , z n is an elimination ordering of (G x , ad x ). We claim that yi 7^ yi + \ and 
that {yi, yi+i} is no edge of G^j+i, where this graph is as in the definition of elimination 
width with respect to the sequence y\,...,y n . These claims are proved later, let us first 
show how to make use of them. The fact G x = G^ and the non-edge between yi and y.; + i in 
G^j+i imply that the width of z\, ■ ■ ■ , z n with respect to \ and ip is also k. From yi ^ Vi+\ 
it is easy to see (and somewhat implicit in the proof of Lemma I3.20P that ead x = ead^, . In 
particular, ■(/'-faults and x-faults coincide. By construction, z\,...,z n has one such fault less 
than yi, . . . , y n , so by minimality of the counterexample there is some elimination ordering 
of (G x ,ead x ) of width k. As G x = G^ = G v and ead x = ead^, = ead^,, it is also one of ip 
of equal width, in contradiction to ip, k being a counterexample. 

Now for the claims. First, assume for contradiction that y-i <^ yi+%. As yi ^ y-i+i, this 
is not due to Reflexivity. As yi + \ is the <^,-successor of yi and (^^,) C (<,/,), there can be 
no intermediate variable, hence yi <^ yi + \ is due to Alternation. But then Q yi ^ Q Vi+1 , so 
ead^(yi + i) > ead^(y) + 1 in contradiction to i, i + 1 being a ip-£au\t. 

For the second claim assume, again for contradiction, that y, forms an edge with yi+\ 
in G^j+i, i.e. in Gw, there is a path from yi to j/i+i such that all internal vertices of that 
path are of the form yj with j > i + 1. We view the path as an entanglement chain 
yi = Vo,...,v n = yi+i- As the elimination ordering is <</,, we have yi,yi + \ <^ Vj for 
all < j < n, that is the chain is hanging. If Q Vi 7^ Qy i+1 , then the chain is nice by 
Lemma 13.241 In particular, t/j <^ t/i+i, contradicting the previous claim. Now for the case 
Qyi = Qyi+i- Then ead^(yj + i) < ead,/,(y,) implies that there is some w £ vai(ip) such that 
ead,/,(tt>) = ead,/,(y,) — 1 and w <^ yi. It follows that Q w 7^ Q Vi . The fact w ^ yi is 
not due to Reflexivity because Q w 7^ Q Vi . It is also not due to Transitivity: otherwise let 
u be the intermediate variable. Then Q u = Q yi or Q u = Q w , without loss of generality 
the latter. Then w ^ u is due to Transitivity and further unfolding Transitivity until 
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Figure 1: Example l3.261 the formula graph of <p n and an ead^ n -stratified tree decomposition 
of width 1 of tp n . 

Alternation is applicable eventually yields some v! such that w -^w, u' ^w, u -<^ m and 
Qw 7^ Qu> 7^ Qm 7^ Q^i- Then ead^(w) + 1 = ead^fyi) > ead^(w) + 3, a contradiction. 
Hence w -^w, yt is due to Alternation, so there is some corresponding entanglement chain. 
Prepending it to the one we have yields a chain w = V- m , . . . , V-±,vq = yi,v%,. . . ,v n = Vi+i- 
Furthermore this chain satisfies the first three conditions of Lemma f3.23l (with respect to tp). 
By virtue of Lemma 13.241 the other two conditions are implied. Hence Lemma 13.231 yields 
w <^ y i+ i, so ead^(y i+ i) > ead^(w) + l = ead^(yj), contradicting ead^(y i+ i) < ead^(yj). D 

The following example shows that the difference in the opposite direction can be un- 
bounded. 

Example 3.26. For n > 0, recall the formula (p n from Example 13.91 We have seen that 
ead^^j) = ad ¥ , n (xj) = 1 = ead lfin (z) for i < n, and ead </ , n (y) = ad^y) = 2, whereas 
ad lfin (z) = 3. Figure [Q shows the formula graph of (p n together with an ead^ -stratified tree 
decomposition of width 1 of (p n . We actually have fotw(<£> n ) = 1. On the other hand, it 
is easy to see that every ad ¥ , n -stratified tree decomposition of ip n has a piece var(c/? n ) or 
var(t/? n ) \ {y}, and hence tw(G ¥ , n , ad^) = n. 



4. Computing stratified tree decompositions 

In this section, we show that computing stratified tree decompositions of optimal width 
is fixed-parameter tractable, where the parameter is the stratified tree-width. In fact, the 
running time is essentially linear for bounded stratified tree-width. In the next section, we 
will use the algorithm developed here as a first step for formula evaluation. 

Definition 4.1. Let G = (V,E) be a graph and d : V -> N. We say that (G, d) is 
normalized, if for n := \V\ we have V = {1, . . . , n} and d(v) < n for all v € V. 

We obtain a linear time algorithm only for normalized inputs. In the following, let 
(G, d) with G = (V, E) be normalized. We set d max := max^gy d{v) and for any integer i 
we let Xi := {v € V \ d(v) = i). Let Nq(C) denote the set of neighbours of C in V \ C. 

Definition 4.2. For < i < d max , let G^ 1 ' be the graph with vertex set V, such that two 
vertices x, y form an edge in G^', if in G there is a path from x to y with all internal vertices 

in Uj>ma,x(i,d(x),d(y)) X j- 
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In particular, every edge of G is present in all G^ 1 ' by virtue of a path without any 
internal vertices. For G' rfmax ) , we can use only such paths, so G^ rfmax ) = G. Observe the 
similarity of the graphs GW to the graphs Gj in the definition of elimination orderings: in 
both cases the connectivity through some vertices is redirected to hold immediately between 
those vertices thus connected. The main differences are granularity (the number of graphs 
is d max + 1, respectively \V\ + 1) and the fact that in the definition of GW, no vertices are 
deleted. Indeed, we have GW[(J Xj] = G\ \\ x-\- This also explains our interest in the 

G^ 1 '. We are (implicitly) looking for elimination orderings for G which respect d. Such 
elimination orderings keep the Xi intact, so the G^ 1 ' will (up to vertex deletion) occur as 
the Gj at the boundaries between the different Xi. 

Definition 4.3. The component tree of (G, d) is a rooted tree with nodes t labelled by two 
subsets of V, denoted by Ct and D t . For the root r, we let C r := V. For a node t at level 
i (where the root has level 0), we let D t := Ct n {Jj <{ Xj. For each (nonempty) connected 
component C of G[Ct \ D t ], node t has a child tc and we let Ct c := C U Ng(C). Further, 
we let B\ := D t n ^ and D 2 t := D t \ D\. 

Lemma 4.4. Let G be a graph and d: V(G) — > N. Let t,u be nodes of the component tree 
of (G,d), where u is a child oft. Then 

(1) D t nD u = D t nC u . 

(2) Let (T, B) be a d-stratified tree decomposition of G. Then some piece of (T, B) covers 
D t C\D u . 

Proof. Let i be the depth of t in the component tree and let C be the connected component 
of G[C t \ D t ] such that C u = C U N G [C). 

1) It suffices to prove D t n D u D D t n G u . If x G A n C u , then x 6 D t C Uo<i<i ^i 
and hence x G G„ n Uo<j<i+i ^i = ^«- 

2) By Fact 12.21 it suffices to show that any pair of distinct vertices x, y E Dt n -D« occurs 
together in some piece of (T,B). As x,y E Dj, we have x,y G, so x, y E Nq(C). Since 
G is a connected component, there is a path P from x to y with all internal vertices in G. 
Hence <i(z) > d{x) and d(^) > <i(y) for all internal vertices z of P. 

Let x = xo, xi, . . . , x n = y be the path P. If n = 1, then G already contains the edge 
{x,y}, which hence is covered by (T,B). Otherwise, the set of tree nodes covering vertices 
from P \ {x, y} induces a nonempty connected subtree T" in T. Let s be the <r-minimal 
node of T' and let 1 < i < n be such that s = t Xi . As {x, xi} is an edge of G, the vertex x 
is covered by some node t' of T". By definition of s and t^, we have s <t f and t x <t t'. 
Hence, both t x and s lie on the unique path from the root of T to t' and thus they are 
comparable by <t- As t Xi = s <t t x would contradict <i-stratifiedness, t x <t s <t t' 
follows. Using (TD3), we conclude that s covers x. Analogously, s covers y, so x and y 
occur together in B s . □ 

While the component tree is useful for mentally addressing the task of computing a 
stratified tree decomposition, we cannot afford to actually compute it: its size is superlinear 
and we want to achieve a linear running time. The algorithm instead works with a modified 
variant. First, we can do without the GtS, so the algorithm only computes the tree itself 
and the Djs. This projection is necessary to obtain a linear size, as is the following second 
modification: in case for some node t at level i > we have Dt D Xi = 0, then we omit the 
node t, making its only child instead a child of the parent of t. We term t a dropped node. 
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Whenever we talk about the level of a node in the modified component tree, we always refer 
to the level the node had prior to dropping any nodes. As a third modification, for each 
node t, say at level i, we store G^ _1 )[-Dj] alongside Dt. For technical reasons which will 
become clear in the proof of Lemma \A.5\ we allow the encodings of these G^~ l >[Dt\ to have 
multiedges. To keep the notation simple, let us fix the convention that \E\ for some edge 
set E with multiedges denotes the sum of multiplicities of edges from E. 

Lemma 4.5. We can compute a modified component tree of a normalized (G, d) in time 

0(\V\-tw(G,d) 2 ). 

Proof. Let k := tw(G, d). The modified component tree is computed in a bottom-up fashion. 
For this observe, that the various C used in the definition of the component tree at level 
i are the connected components of G[|J Xj]. Hence each Ct for t at level i is C U Ng(C) 

for such a C. In order to determine the corresponding Dt, we do not need the full graph: 
it is irrelevant which edges between vertices of (J Xj are present; it suffices to know what 

j>i 

connectivity these induce on (J Xj . This information is present in G^> . More precisely, we 

j<i 

can transform the defining equality Dt = (G U Ng(C)) H [J Xj, where C is some connected 

j<i 

component of G[{J Xj], into the equality D t = D U (N G (i)(D) n [J Xj), where D is the 

j>i j<i 

corresponding connected component of GWfXj]. 

Recall that, as {G, d) is normalized, the vertices of G are 1, . . . , n, and d max < n, where 
n = \V\. More precisely, we assume that G is given as an array of adjacency lists and that 
d is given as an array of efs values, where both arrays are indexed with vertices. Using 
bucket sort, we compute the sets Xq, . . . , A^ max in time 0{n + <i max ), which is 0(n) thanks 
to (i max < n. 

For the bottom- up run, the algorithm uses a loop i = d max , . . . , 1. As an invariant, at 
the beginning of run i it has the following data: 

• The graph G^> (possibly with multiedges). 

• An array of lists of subtrees of the modified component tree. Overall, the lists contain 
all subtrees rooted at level i + 1. For all < j < d max , the list at entry j contains the 
subtrees which, after dropping nodes, end up at level j. 

For j > 0, we also store an element from Dt n Xj^\ alongside each subtree, where t is 
the root of the subtree. 

The graph is initialized to G'( c ' max ) = G. This can be done in zero time, because G is not 
needed any more. The array is initialized with empty lists, using time 0(d max ) and thus 
0(n). For a single run of the loop, we first do a depth-first search through G^'[Xi\. More 
precisely, the search is in G^ 1 ' , it uses all elements from Xi as entry points, and it terminates 
recursion in elements from other Xj. In this way, the connected components D of G^pQ] 
are found, and we also directly obtain the corresponding DL)(N G (i) (D)C\ (J Xj), that is some 

j<i 

new Df. During the search, we do some more things, which only increase the running time 
by a constant factor: for each such Dt, we generate the tree node t and we label each vertex 
of Dt with t. The label persists only until the next i. Also, for each such t, while constructing 
D t , we also compute the sets D\ and Df, and we start constructing G^ -1 ^/^] by building 
a graph with vertex set Dt and all edges from G'^ [Dt] incident to (at least one vertex of) 
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D\. Furthermore, we determine the maximal j, such that D 2 n Xj 7^ and we remember 

some x € D 2 n Xj. The running time of the search is 0(\Xi\ + \{e € E(G^)\e nlj/ 0}|). 

The second part, the number (respecting multiplicities) of edges of G^ 1 ' incident with Xi, 

is bounded by the number of edges of G^ ' incident with Xj. Summing over all i, the total 

running time of all searches is then 0(n + \E(G^-°')\). 

Next, we update G® to G^" 1 ' by introducing, for each new t, edges between each 

two vertices from D 2 . We also do this in the graph stored at node t, which thus becomes 

G( l_1 ) [Dt] as needed. We do not check whether the edges were already present (because we 

do not have enough time to do so), hence multiedges may be introduced. The running time 

is 0(|Z?t| + £), where £ is the number of edges introduced. Summing over all iterations, the 

first part \Dt\ is bounded in the same way as the running time of the depth-first searches. 

The second part £ is bounded by ( 2 ), where k := tw(G,d) + 1: as D t n |J Xj = D t n D u 

j<i 

for the parent u of t, we can conclude \D 2 \ = \Dt n |J Xj\ < k from Lemma 14.41 2. Hence 

j<i 

in total, the second part induces a running time of 0(m ■ k 2 ), where m is the size of the 
tree. As this is the only place, where we add edges to G^\ we can also bound |i£(Cr( ))| by 
\E(G)\+m-( k 2 ). 

We already have all the nodes for level i. We only need to connect them to the nodes 
of level % + 1 (in case i < d max ). For this, for each tree in the list at the array entry i + 1, 
say with root u, we look at its element from D u n Xj, say y, then look at the label of y 
generated above (as y € Xj, we did consider y in the depth- first search), say t, and make 
u a child of t. As this is done at most once for each tree node, the total running time of 
doing this throughout the loop is linear in the size of the tree. Last, for all new t, we recall 
j and x determined above, and we add the tree rooted at t together with x to the list at 
array entry j + 1. 

This concludes the description of the loop. After it has finished, we generate the root 
node r, set D r := Xq and make all trees in the list at array entry children of r. The total 
running time of computing the modified component tree is thus 0(n + \E(G)\ + m ■ k 2 ), 
where, again, m is the number of nodes of the modified component tree. For the second 
term we have |i?(G)| < n-tw(G) < n-k by Fact 12. ll Now, let us estimate m. Each non-root 
node t of the modified component tree is a non-root node of the original component tree, 
say at level i > 0, satisfying D t n Xj 7^ 0. Hence, there is an element xt € D t n Xj. Let 
u be the parent of t in the original component tree. Then xt belongs to the connected 
component C of C u \ D u , such that Cj = C U Nq(C). For different t at level i, the xt come 
from different connected components, so there can be at most |Xj| such t. In total, m < n, 
so the running time is 0(n ■ k 2 ). □ 

Remark 4.6. As a corollary to the previous proof, let us observe that G^> is obtained 
from G by turning all D 2 into cliques. 

Theorem 4.7. There is an algorithm that, given a normalized (G,d), computes a d- 
stratified tree decomposition of G of minimum width in time 0(|V(G)| • 2 P y( - tw ' ' •*'). 

Proof. Observe that, if u is a child of t in the component tree, then Dt n D u = D 2 . Then 
Lemma 14.41 2 and Remark 14.61 imply that d-stratified tree decompositions of G coincide 
with those of G^°\ Hence it suffices to compute a d-stratified tree decomposition for G^ ' 
of minimum width. This is achieved in four phases. First, a modified component tree is 
computed as in Lemma 14.51 
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In the second phase, for each node t of the component tree, an optimal tree decompos- 
ition of G(°'[Dt] is computed. Observe, that, if i is the level of t, then G^°'[Dt] coincides 
with G^ 1 '[Dt], which we have stored at node t. Hence, for a single t, using Bodlaender's 
algorithm [6] (which works well even in the presence of multiedges), the tree decomposition 
can be obtained in time ||G( i-1 )[A]|| ■ 2 pol y( fe ). Observe, that the sum of all WG^'^ [D t }\\ 
is bounded by the encoding size of the modified component tree, which is bounded by the 
time it took to compute it. Hence, overall this phase takes time n ■ 2 poly ( fc ). 

In the third phase, for each node t of the component tree and each parent or child u of 
t, we determine a node xt jU of the tree decomposition of G^- ' [Df], whose piece fully contains 
Dt n D u . Observe, that Dt n D u coincides with D\ or with D^, depending on whether u is 
a parent or a child of t. Hence, Dt n D u is a clique in G^°\ and then also in G^°'[Dt], so 
there must be some xt, u as above. We find xt, u by first reading D\ respectively D\ from 
the modified component tree, and then checking each element from each piece of the tree 
decomposition against this set Dt n D u , until some adequate xt tU is found. Even without 
sophisticated search structures, 0(£ ■ \D t H D u \ < £ ■ k) time suffices, where £ is the size of 
the tree decomposition. Overall, this phase takes time 0(n ■ k s ). 

In the fourth phase, we use a bottom-up recursion to construct, for all nodes t, an op- 
timal tree decompositions of G^ ' [Ct\. It will happen to extend the above tree decomposition 
of G^°'[Dt\. The recursion works as follows: we cycle through the children of t. By virtue 
of bottom-up-ness, for each such child u, we already know an optimal tree decomposition of 
G^ ' [C u ]. As it contains the known decomposition of G^ ' [D u ], it also contains the node x u t- 
Hence we connect it to the known decomposition of G^ ' [Dt] by adding an edge between x u ,t 
and xt <u - The result still is a tree decomposition, because the intersection D t riD u separates 
Dt from C u (Lemma 14.41 !)• It is an optimal tree decomposition, because its width is the 
maximum of the widths of the participating tree decompositions, which were optimal for 
their respective subgraphs. In the end, the recursion yields an optimal tree decomposition 
(T, B) of G(°> [C r ] = G(°). It takes time 0{m). 

Last, pick some node s from the tree decomposition of G^°'[D r ]. As B s C D r C 
Xq, choosing s as the root of (T,B) makes it (i-stratified. This concludes the proof of 
Theorem 14.71 □ 

As each (G,d) can be normalized in time 0(n log n), a similar statement holds for 
arbitrary (G,d), albeit not with linear running time. Theorem 14.71 implies that we can 
efficiently decide whether a formula (p satisfies fotw(yj) < k: 

Corollary 4.8. There is an algorithm that, given a formula (p G C, computes a tree de- 
composition of ip of minimum width in time poly(||y||) + |var(y>)| • 2P ol y( fotw w)) . 

Proof. Given 93, it is first turned into a straight formula. Then, G v and ead^, are com- 
puted and normalized in polynomial time. The last step is a call to the algorithm from 
Theorem 14.71 □ 



5. Query evaluation on bounded first order tree-width 

This section contains the second main result: evaluating formulae of bounded tree-width 
is fixed parameter tractable with parameter the length of the formula. Moreover, we show 
that evaluating quantified constraint formulae of bounded first order tree-width can be 
done in polynomial time. This is stronger than Chen and Dalmau's result |9j for quantified 
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constraint formulae of bounded elimination-width: as we will see in Section [6l bounded 
elimination-width (i.e. bounded tree-width in [9]) implies bounded first order tree- width, 
but there are classes of quantified constraint formulae with unbounded elimination-width, 
that have bounded first order tree-width. 

Definition 5.1. Let k > be an integer. The fragment C k of £ consists of those (not 
necessarily straight) formulae <p such that |var((/?)| < k. In contrast, let C k be the fragment 
of formulae <p of C such that all subformulae of ip have at most k free variables. 

Remark 5.2. Obviously, C k C C k . On the other hand, formulae from C k can be turned 
into equivalent formulae from £ by renaming bound variables. The algorithm runs in 
polynomial time, and in linear time for fixed k. □ 

By Remark 15.21 we can use C k and C k interchangeably. It will be more convenient to 
work with the latter. 

Remark 5.3. For all k, the question whether a given formula is equivalent to an C formula 
is undecidable. 

The statement is folklore, but we provide a proof for completeness' sake. 

Proof. We reduce from satisfiability of C. By introducing a new unary predicate U and re- 
lativizing all quantifiers to U, satisfiability reduces to the question, whether a given formula 
from C is satisfiable by an infinite structure. By the theorem of Lowenheim and Skolem, 
this in turn is equivalent to satisfiability by a countable infinite structure. Now let 92 be 
some fixed formula such that <p does not have finite models, and such that p is not equi- 
valent to any £ -formula. Then a given formula ip is unsatisfiable by infinite structures, if 
and only if ip A ip' is equivalent to an C formula, where ip' is obtained from ip by renaming 
all symbols (constant symbols, relation symbols, and free variables) to be disjoint from all 
symbols of ip. For the 'only if part, unsatisfiability by infinite structures of ip implies the 
same for ip' . Then, ip A ip' is unsatisfiable and hence equivalent to Pc A —>Pc from C°. For 
the 'if part, assume that ip is satisfiable and that tp /\ip' is equivalent to x £ £ k ■ We obtain 
x' from x by renaming all symbols which do not occur in p> to be disjoint from all symbols 
of ip'. By choice of x we have x N '■P an d then x' N f- The converse would contradict 
that ip is not equivalent to any C -formula, so there is some interpretation T\ such that 
T\ \= <p A —>x'- As ip does not have any finite models, the domain of X\ is infinite, and by 
Lowenheim-Skolem we can assume without loss of generality that it is countable. On the 
other hand, satisfiability of ip implies that of ip' , say by an interpretation Z2 with countable 
infinite domain. By virtue of all the renaming done above, T\ and I2 do not have any 
symbols in common. Furthermore, their domains are of the same cardinality. Hence there 
is an interpretation X which extends both (up to isomorphisms) , that is we have I \= pAip' 
but I \/= Xi a contradiction to the assumed equivalence. □ 

As a consequence, there is no computable width parameter such that width-/c captures 
all formulae logically equivalent to a formula of C k . For any width parameter, only for- 
mulae which are 'syntactically close' to a formula of C are captured, for varying values of 
'syntactically close'. 

The following example shows that first order tree-width indeed does not capture equi- 
valence to C? . 

Example 5.4. Let n > 2. cp n = ip n V Xi where x € £ 2 an d fotw(^ n ) = n, but ip n is 
unsatisfiable. Then ip n = x £ £ 2 , but fotw(c^ n ) = n. 
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First order sentences of tree- width at most k — 1 have the same expressive power as C 
(and by Remark 15.21 hence as C ). More generally we have the following. 

Theorem 5.5. Let k > 0. 

(1) For any formula ip with fotw(yj) < k — 1 i/iere is a formula ip G C with ip = -0 which 
is computable from <p. 

(2) ^4ny formula ip £ £ k satisfies fotw^) < A; — 1. 

Proof. 2. By Theorem 13.251 it suffices to show that tw(G^,ad^,) < A; — 1. Without loss 
of generality, ip is already straight. Let T be the syntax tree of ip. For a node £, say t 
corresponds to the subformula x °f ^Pi let &t consist of the free variables of X- As ip E £ , 
the width of (T,B) is at most k — 1. (TD2) holds for (T,B), because edges arise from 
atoms and atoms are among the subformulae x considered in the definition of (T, B). Then 
(TD1) follows, because ip is straight. (TD3) holds, because for all variables x, the set 
{t S T | x S Bf} union of all paths from (nodes corresponding to) atoms in which x occurs 
to (the node corresponding to) the scope of x. It is easy to see that (T, B) is ad^,-stratified. 

1. Given ip, we use the algorithm of Corollary 14.81 to compute an ead^-stratified tree 
decomposition (T,B) of width fotw(<^) < k — 1 for G^. 

Intuitively, we will iteratively replace subformulae of <p by equivalent £ fe -formulae, until 
we obtain an C k formula ip equivalent to ip. The replacement is done along a tree decom- 
position (T, B) of <p> of width at most k — 1. For our purposes it is more convenient only to 
work in leaves of the decomposition, so in every iteration we restrict the decomposition to 
the part of the formula that still has to be transformed into an C k formula. In doing so, we 
treat the subformulae p' of ip already in C as new atoms, defined on the variables free(</? / ). 
For this we make sure that these new atoms are covered in some piece of the remaining part 
of the tree decomposition. 

More precisely, we describe an iterative algorithm. In every iteration, we are given a 
formula <p' in xenerp normal form, a tree decomposition (T',B') of p>' of width at most 
A; — 1, and a second order substitution S, substituting relation symbols of ipt by C k formulae 
with the appropriate number of free variables, such that we have tp'S = (p. We will need to 
extend the syntax of formulae: we allow any monotone Boolean function as a single Boolean 
connective. Of course, this extended syntax still allows for negation normal form, and we 
still assume that all formulae are in this normal form. This syntax extension carries over 
to the definition of (p\x] and hence of ead^. 

We start by letting cp' be some xenerp normal form of p, (T' , B') = (T, B), and S = 0. 
Now in every step we do the following. 

1. If ip' is quantifier free, then var(<//) = free (9?). As |free(y>)| < iotw(ip) + 1 < A; (recall, 
that free(<^) forms a clique in G v ), we have <p' € C k , so the algorithm stops with output 
ip'S. More precisely, <p'S may still use our additional Boolean connectives, but it is trivial 
to eliminate these without leaving C (but in general making the formula non-straight). 

2. Otherwise, if (T', B') has a leaf £ with parent t satisfying Bi C Bt, then we remove 
£ and Bi from (T',B'), keeping ip' and S. 

3. If neither 1 nor 2 apply, then we choose a variable x G var(y/) as in the following 
claim. 

Claim 1. There exists a bound variable x € var(<//) with ead ¥ ,/(x) maximum, such that x 
appears in exactly one piece B' £ of (T', B'), and £ is a leaf. 
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Proof. This claim is a variant of Claim Q] in the proof of Theorem 13, 11 and in can be shown 
in the same way. Instead of smallness of the decomposition, we use the fact that Case 2 
does not apply. Thus we obtain a variable x with ead^' (x) maximum and a leaf £ such that 
x appears only in Bg. As Case 1 does not apply, there are bound variables. As ead ¥ ,/(x) is 
maximum, x is a bound variable. ■ 

Without loss of generality, suppose that Q x = 3. Let ip be the scope of x in cp' and let 
V be the set of variables quantified in ip. We partition V into the subsets V\ and V2, where 
V\ contains the variables with which x is entangled in ip' . As <p' is xenerp, ip is <p' x preceded 
by some quantifications of variables from V\ . By maximality of ead^/ (x) , we have Q y = 3 
for all y 6 V\. 

Claim 2. No variable y 6 V\ is quantified in the scope of some variable z S V2. 

Proof. Otherwise, x occurs in p' y (because x and y are entangled), ip' y is a subformula of ip' z 
(because tp' is xenerp and z < v i y), and z occurs in tp' z which is a subformula of <p' x . Hence 
x and z are entangled, contradicting z € Vi- Thus the claim holds. ■ 

Claim 3. x is ^w-maximal. 

Proof. Otherwise there is some j/7^1 such that x -<^i y. In the inductive definition of ;^w, 
the pair (x, y) is not introduced by Reflexivity. If it is introduced by Transitivity, say with 
intermediate variable z, we can replace y by z, so we can eventually assume that x ^w y 
is due to Alternation. In particular Q x ^ Q y . Thus ead v ,'(y) > ead (y0 '(x), contradicting 
ead^/ -maximality of x. ■ 

In the same way we can show that there are no ^/-relationships among V\. 

Let ip 1 be obtained from ip by removing all quantifications for variables from V\. Letting 
3Vi denote the sequence of all these quantifications in arbitrary order, 3V\ip' is obtained 
from ip by a sequence of replacements as in Lemma 13.201 parts 1 and 2. In particular, z\xip 
is equivalent to 3x3V\ip' . 

As all variables quantified in ip' are from V2 and their scopes in ip' are the same as 
in ip, we have that x does not occur in any quantified subformula of ip'. Hence ip' is a 
positive Boolean combination of atoms, of negated atoms, and of subformulae in which x 
does not occur. By choosing the subformulae in which x does not occur maximal, ip' is a 
positive Boolean combination of atoms using x, of negated atoms using x, and of maximal 
subformulae of ip' in which x does not occur. We transform this Boolean combination into 
disjunctive normal form Vie/ Aigj ^j w ith literals Lj. Here each Lj is an atom using x, a 
negated atom using x, or a maximal subformula of ip' in which x does not occur. Then 

3xip = 3x3V 1 ip' = 3x3Vi \J f\ Lj = 3V t \J 3x f\ Lj. 

ieijeJi iei je.J t 

For i 6 I let Jpp C Jj be the subset of indices j such that x does not occur in Lj and let 

Jj + = J-i\ Jpp ■ For convenience we let J + := |J J t + and J~ := |J J^. Now 

iei iei 

3xiP^3V 1 \j3x l\L ] ^3V 1 \J{ /\ Lj A3* /\ %). 

is/ jeJ, ie/ jgj- j G j+ 

Let j £ J + . As (T', £>') is a tree decomposition of 99', (the atom underlying) Lj is covered by 
some piece of (T" , £>'). Since x is a variable of Lj and x occurs in i^ only, Lj is covered in 1?^. 
Hence for i E/ and ipf := 3x /\- £j + Lj we have var(ipf ) C .B^. Let xy be an enumeration 
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of Bg, and for all i £ I let Af be a new relation symbol of arity \Bg\ — 1 < k — 1. Replace 
3a;^> in V 7 ' by 3Vi^", where 

r--=\/{ hLjAAfy) 

and let 92" be the formula thus obtained from 99'. Let <S X be the substitution which replaces 
every Afy by ipf, respectively. Clearly, 92' = cp"S x , so by setting S" := S X S we have 
93 = <p' S = ip" S' . More precisely, in the definition of ip", we use a single positive Boolean 
connective (as allowed by the above syntax extension) for the entire disjunctive normal 
form. This Boolean connective has only one input for each Lj used. In particular, we retain 
a single quantifier per variable quantified in ip, that is we retain straightness. 

We obtain a tree decomposition (T", B") for 92" by letting T" := T' and removing x 
from B' t This tree decomposition still covers all (edges of G^n created by) atoms of 92" 
which also occur in 92', because these do not use x. The new atoms Afy are covered by 
B'l = B' t \ {x}. This shows (TD2). (TD1) and (TD3) are inherited from (T',B'). 

In the rest of the proof we show that (T", B") is ead<n» -stratified. 

Claim 4. Let y be a bound variable and Z a set of bound variables such that x =^ y and 
x Z. Then y occurs in 92^, if and only if y occurs in (p', z , or 92^, is a subformula of (p' x 

but not of any Lj with j € J~ and y occurs in <p' x . 

Proof. Observe that <p', z , is generated already by at most two occurrences of variables from 
Z. If one of these occurences is outside of ip, then f\ z \ is disjoint from ip and ip' x , or ip and 
(p' x are proper subformulae of <p', z , . In either case we have to show that the same variables 
y ^ x occur in ipt.g, as in <p'! z y This follows, because in the first case ip', z , = ip'/ z , whereas in 
the second case (p'/ z , is obtained from 99^, by replacing the subformula 3xip with 3V\ip". It 
remains to consider the case where all (generating) occurrences of variables from Z in ip' are 
in ip. Let j±, J2 be such that the generating occurrences are in Lj 1 and Lj 2 . If j\ = j'2 € J~, 
then ipfrg, is a subformula of Lj 1 . Furthermore ip'/™ = ip', z i , so the same variables y occur in 
these formulae. If any generating variable is from B'^, then it occurs in all atoms Afy, so 
(p'/yi = ip" ■ In the last case, ji,J2 € J~ and j\ 7^ J2- Again, tp'/y, = ip" . In both cases there 
is no single j G J~ such that <p', z , is a subformula of Lj , so we need to show that the same 
variables y occur in <p'/ z , as in <p' x (including the special case of occurrence in the subformula 
(p', z , of cp' x ). This follows because (p'/ z , = ip" and ip" uses the same variables as <p' x . This 
shows the claim. ■ 

For arbitrary variables y, z different from x, the two following claims show that y ^„/ z 
if and only if y ^n z. 

Claim 5. Let y, z be variables different from x. Then y ^„/ z implies y ^» z. 

Proof. First, consider the following modification of ^/. Relax, in the definition of entan- 
glement with respect to < and cp' , the conditions of the form l v occurs in <p>', < ,' by l v occurs 
in <p\ ^ or in Vrf <i\/ u'i with the convention that no variable occurs in cp'L. Let ^ denote 
the modified relation. It is clear that y ^ v > z implies y ■< z. By induction on the definition 
of ^ it is easy to see that x also is ^-maximal. 

In order to transform derivations for ^ into derivations for ;fw', we start by normal- 
izing them. First, we may assume that no entanglement chain used in any application of 
Alternation repeats elements. Also, we may assume that Reflexivity and Transitivity are 



24 I. ADLER AND M. WEYER 



always applied as early as possible. In particular, at each application of Alternation, the 
current approximation for X is reflexive and transitive. The third normalization concerns 
applications of Alternation, say with chain vq, . . . ,v n and with < as the current approx- 
imation of X. Such an application is normalized, if for entanglement, instead of the sets 
Vi <, already the sets (v{ <) \ {x} suffice (unless t>i = x, in which case (vi <) = {x} due to 
maximality of x). Now we claim that this normalization is always possible. We prove the 
claim by induction on derivations for ■< which are already in the first two normal forms. 
Only the Alternation step is nontrivial, so consider an application of Alternation with chain 
vq, . . . ,v n and approximation < for ^. By the inductive hypothesis, all pairs from < can 
be derived in a normalized way. If our application of Alternation is not already normalized, 
then there is some < i < n such that x ^ v i < x. After unfolding Transitivity, we obtain 
some w such that Vi < w and such that w < x holds due to Alternation (possibly w = vi). 
Let (<') C (<) be the approximation of ^ pertaining to this application of Alternation. 
Then there is another chain of entanglements ending with u, x, such that w <!' u or x <' u. 
Due to the normalizations from the inductive hypothesis, x already occurs in t p , \( u< i)\i x y\ or 
in y|/ u< /\\ (-„-).]■ As x does not occur in ip" at all, the former must be the case. As x <!' u 
would contradict ^-maximality of x, we have w <' u. Transitivity gives Vj < u. Now 
consider one of the up to two neighbours of v% in the entanglement chain, without loss of 
generality we pick Vi + \. We distinguish the cases Vi + \ = x and vi + \ ^ x. In the first case, 
we already know that x occurs in <pi, <i i\\f x \]- From v- h < u and transitivity we conclude 
(u <') \ {x} C (uj <) \ {x}, so x occurs in p'\( v -<i)\i x \] as needed. Now for the case that 
«i_l_i ^ x. As nj + i occurs in yL.^ or in yrL.<\\ r x \i, it is easy to see that Vi + \ also occurs 

in tfu v .<i\\i x \\js u \y But as V{ < u, this is V'f'/v^'iUxir ^ n a ^ as * normanza ti° n step, we also 
eliminate x from occurrences in entanglement chains in derivations. Hence assume that 
v, x, w is part of such a chain, again with < as approximation of ■<. As x is ^-maximal, we 
have (x <) = {x}, hence u occurs in <p\t x -\-\ and then also in ip" . As x occurs in ( f^i( w< )\r x v 
(recall the previous normalization), we conclude that ip" is a subformula of <p\'( w< \\ r 11 • Thus 
u occurs in ( / 7 f( u ,<i)\{ x >i an d similarly w occurs in 9 3 |(„<)\{ a; \i; so x can De omitted from the 
chain. After this normalization, we can assume that x occurs in applications of Alternation 
only as an endpoint of the chain. 

Now assume some counterexample y, z to the claim. Then y -< z. Let y, z be derivation- 
minimal with respect to normalized derivations for ^. Then y ^ z because ^ </3 » is reflexive. 
Next assume that y ^ z is due to transitivity, say with intermediate variable v. If v = x 
then x < z ^ x contradicting ^-maximality of x. Thus v ^ x, so y -< v » v ^^» z by 
minimality of the counterexample, y ^^" z follows. Finally for Alternation, let us assume 
some chain y = vq, ■ ■ ■ , v n = z of entanglements in the sense of ^, such that y -< V{ or z -< Vi 
for all < i < n. As always, denote the approximation of ^ by <. By normalization of 
derivations, no Vi equals x and x is not needed for entanglement. From derivation- minimality 
we conclude y ^n Vi or z ^^» Vi, and that (vi <) \ {x} C (v% d^p") for all < i < n. By 
Claim HI the occurrence of v%+\ in $\< v .<£\\i x \\ or { P'\(v<£\\s,x\\ i m P nes occurrence of v%+\ in 
ip'L.^ l- Similarly, Vi occurs in cp 1 / ^ ,. Thus, the entanglement chain is also such a one 

in ip". Also y < v > z implies y < v » z so we conclude y < v ii z. ■ 

Now for the converse. 

Claim 6. Let y, z be variables different from x. Then y ^„« z implies y ;fw z. 
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Proof. We work by induction on ^w/. The cases Reflexivity and Transitivity are immediate. 
For Alternation let < be some approximation of rfw', assume y <„» z and Q y 7^ Q z and 
let y = wq, . . . , w n = z be given such that for all < i < n we have y <^n Wi or z < v ii Wi, 
occurrence of Wi in (p',' w . <,,, and occurrence of Wi + \ in <p'\ w .<i\- The inductive hypotheses 
are y ^^/ ioj or 2; < v i W{, and (<) C (^^'). As <^/ and <^» differ only among Vi and we 
have Qy 7^ Q z , we conclude y <«/ 2. If furthermore all occurrences also hold in </?', then 
we are done. So assume that there is some % such that Wi is not entangled with Wj+i in <//, 
that is iuj does not occur in f' Wi+1 or tUj+i does not occur in (p' w . . Without loss of generality 
let us assume the former. Claim [4] then implies that Wi occurs in <p' x and that <p' Wi+1 is a 
subformula of ip' x but not of any Lj with j € J~ . By maximality of these Lj among the 
subformulae not containing x, this implies that x occurs in tp' Wi+l - Of course Wj+i occurs 
in (/'wi+i an d thus in ^. Hence x and iOj+i are entangled in ip? . Next let us show that also 
wi and x are entangled. If tUi+i does not occur in (p' w . , then we can proceed as above, so 
we may assume that Wi+\ does occur in ip' w .. Then wi and Wi+i are comparable by <<y. If 
Wi+i <<pi Wi, then xenerp normal form of if 1 implies that Wi occurs in <p' w . in contradiction 
to our assumption. Hence Wi <^i Wi + \ and by xenerp we have that <f>' Wi , 1 is a subformula of 
f>' w .- In particular x occurs in tp' w . and we already know the converse. In this way, whenever 
a link of the ip" '-chain does not hold in ip' , we can insert x to obtain a longer valid chain. 
So far, validity only means that each link is an entanglement in ip' . 

It remains to show that y ^/ x or z ^^ x, if x needed to be inserted. Let v and 
v' be the neighbours of (one occurence of) x in the chain. If v and v' both are in {y,z}, 
then, because x is only inserted between non-entangled variables, one of them is y and the 
other one is z. As Q y 7^ Q z , we have Q v 7^ Q x or Q v i 7^ Q x , without loss of generality the 
former. As x and v are entangled in <// by the above, they are comparable by < v >. Thus 
by Alternation we either have x ^w v, contradicting ^w -maximality of x, or v ;fw x, in 
which case we are done because v £ {y,z}. Now for the case that one of t> and v' is not 
from {y, z}, without loss of generality y ^ v ^ z. Still, v is entangled with x. Furthermore, 
v is an original element of the chain, so y ^O uorz ^w v. Without loss of generality let 
us assume the latter and recall that z/d, After unfolding, in the derivation of z ■^ up l v, 
some applications of Transitivity, we obtain some u such that z ^w u ^w v and u ^<y/ v is 
due to Alternation. Accordingly, let u = to, . . . , £& = ubea witnessing entanglement chain. 
If v <„' £, then we can extend this chain by x to obtain (together with the fact u ^<v v) a 
witnessing chain for u -<^> x which implies z ^^ x using Transitivity. It remains to consider 
the case v £ v i x. But as v,x are entangled, they are comparable by < v i, so x < ip i v and 
thus v G V\ implying Q v = Q x and then Q u 7^ Q x . As u -< v > v we have u < v > v, so x 
and u are comparable by < v i. x <^/ u would imply u 6 V2 (since Q u 7^ Q x ), contradicting 
Claim [2j Hence u <„,' x so the chain to, . . . , t/~ can be extended by x to show u ^w x and 
then z <^i x. ■ 

Hence -<^i and ^<y,» coincide outside of x. As x is ^/-maximal, this implies that 
ead^/ and ead^" coincide outside of x. Hence ead^'-stratification of (T 1 , B') implies ead^,"- 
stratification of (T",B"). Finally, we turn ip" into xenerp normal form. 

It is easy to see that the algorithm terminates: the second case decreases the size of 
the tree decomposition while it leaves the set of variables intact. The third case leaves the 
tree intact and eliminates one variable. As both the tree and the set of variables are finite, 
eventually the first case must trigger and the algorithm stops. This concludes the proof of 
Theorem 15.51 □ 
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It is well-known that first order query evaluation for C can be done in time n + ' 1 \ 
see HH1. 



Corollary 5.6. Evaluating queries of bounded first order tree-width is fixed parameter tract- 
able with parameter the length of the formula. 

More precisely, given a finite structure A and a formula p € C, there is an algorithm 
that computes p(A) in time ||^|| fotw ( </3 - )+0 ( 1 )/(M) f or some computable function f . □ 

Here the function / is basically the running time we need for translating the formula p 
with fotw(y) < k — 1 into an C k formula. It is (/-times exponential, where q is the alternation 
depth of p. The exponentiations arise from converting some subformulae into disjunctive or 
conjunctive normal form. In special cases where this step is not needed, the running time 
is much lower. 

Corollary 5.7. Evaluating formulae without disjunctions and of bounded first order tree- 
width can be done in polynomial time. 

Proof. Let i/>bea formula without disjunctions such that fotw(p) < k. Let us recall the 
algorithm underlying the proof of Theorem 15.51 1. The argument for termination (at the 
very end of the proof) actually shows that the main loop is iterated only a quadratic number 
of times. Case 1 of the main loop is only executed once, and it is polynomial in the data then 
present. Case 2 basically requires a search through the current tree, which is a subtree of 
the original one. It remains to show that Case 3 runs in polynomial time and to bound the 
way it increases the size of the data. In fact, all individual steps of Case 3 run in polynomial 
time even for general p, with the exception of turning ip' into disjunctive normal form. But 
as our p does not contain any disjunctions, the same holds for p' and ip', so ip' already 
is in disjunctive normal form and there is nothing to do. Formally, the index set I only 
contains one element and (up to reordering) we have ip' = ip- A ip+, where ip- := /\ Lj 

jeJ- 
and ip + := /\ Lj. 
je.J+ 

Let us now bound the data increase. The data consist of the tree decomposition (which 
only becomes smaller), the formula p', and the substitution S. It is easier to consider, 
instead of p' and S, the formula p'S. The size difference between the two stems only from 
a collection of pairs (x,i), where x is a variable and i is an index from the respective /. 
As each such I is a singleton, there are at most |var(c/?)| such pairs. With respect to p'S, 
all that happens in Case 3 is shifting quantifiers and then replacing 3x3V\{ip- A ip+) by 
3Vi(ip- A 3xip + ). The size does not change. 

Hence the algorithm from Theorem [531 1 runs in polynomial time. So does turning the 
formula from C k into one from C k . We conclude with the time n k+ °^ l > for evaluating the 
latter. □ 

Of course, the same applies to formulae without conjunctions. One example of formulae 
without disjunctions are quantified constraint formulae, which we define and discuss in detail 
in Section [H 

The fixed parameter tractability from Corollary 15.61 does not remain when we make the 
first order tree-width part of the parameter. This is even true for model checking instead 
of evaluation. Instead, the problem becomes AW[*]-hard: fotw is bounded by the length 
of the formula, and the model checking problem for first order logic, parameterized by the 
formula length, is AW[*]-complete |12j . 
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6. Relation to similar notions 

In this section we show that fotw and tree-width coincide on conjunctive queries, while fotw 
is more powerful than both elimination-width of quantified constraint formulae and strict 
tree-width of non-recursive stratified datalog programs. Finally, we extend the cops and 
robber game characterizing tree-width to stratified tree-width and we prove that requiring 
monotonicity does not limit the cops. 

A (Boolean) conjunctive query is a sentence <p = 3x\ . . . 3x n i/j, where ip is a conjunction 
of relational atoms such that var(^) = {xi, . . . ,x n }. The tree-width of a conjunctive query 
<p, tw(ip), is defined as the tree-width of G v (see [25]). Any conjunctive query ip satisfies 
ead^, = 1. Hence the notion of fotw generalises the notion of tree width of conjunctive 
queries. 

Remark 6.1. Any conjunctive query tp satisfies fotw(^) = tw(ip). □ 

6.1. Quantified constraint formulae. A quantified constraint formula [9 J is a sentence 

ip = Q lXl Q 2 X 2 ■ ■ ■ Qn^nlp, 

where Qi G {V, 3} for i = 1 . . . n and ip is a conjunction of relational atoms. In [9], Chen 
and Dalmau introduce the notion of tree-width of a quantified constraint formula and they 
show that model checking for quantified constraint formulae of bounded tree-width can 
be done in polynomial time using the /c-consistency algorithm. Since their notion of tree- 
width is defined via an elimination ordering rather than via a decomposition, we call it 
elimination-width instead of tree-width. We show that fotw is less than or equal to Chen 
and Dalmau's elimination- width, and we give an example of a class of quantified constraint 
formulae having bounded fotw and unbounded elimination-width. By Corollary 15.71 model 
checking of quantified constraint formulae of bounded fotw can also be done in polynomial 
time. 

Recall that by Theorem 13-H any graph G and d : V(G) — > N satisfies tw(G, d) = 
ew(G,d). Let <p be a quantified constraint formula with formula graph G v . Let ad' be the 
mapping that assigns to a variable v € var(</?) the number of quantifier changes occurring 
before v in the quantifier prefix of ip, adding +1 (note that here we always add +1, regardless 
of the first quantifier of ip). Then Chen and Dalmau's notion of elimination-width can be 
equivalently reformulated in our setting as ew(G ¥> ,ad' ). 

Lemma 6.2. Let ip be a first order formula. Then fotw((/?) < ew(G ip ,ad'). 

Proof. By Theorem 13. 1[ we may use tw and ew interchangeably. Then, by Theorem 13. 25 ( it 
suffices to show tw(G ¥ ,,ad ((0 ) = tw(G v ,ad' v ). First, assume that p starts with a quantifier. 
If this quantifier is existential, then ad^, = ad' . If it is universal, then ad v (x) = ad' (x) = 
for all free variables x, and ad ((0 (x) — 1 = ad'^(x) > for all bound variables x. In both cases, 
an elimination ordering respects ad^, if, and only if, it respects ad'^. Hence ew(G tfi ,ad !fi ) = 
ew(G ¥ ,, ad' ) and thus tw(G ¥ ,, ad^) = tw(G v , ad' ). 

The same holds trivially, when <p is quantifier free. 

Now for the general case: <p is a positive boolean combination of formulae tpi, . . . ,<p n 
which are quantifier free or start with quantifiers. Let F be the set of free variables of ip. 
We already know that tw(G v , i ,ad ( ^J = tw(G ¥ , i ,ad' ) for all 1 < % < n. As the various pi 
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have pairwise distinct bound variables, G v — F is the disjoint union of the G Pi — F. Then 
it is easy to see that 



and 



tw(G ip ,ad ip ) = max(|F| - 1, max tw(G iPt ,ad ipi )) 

Ki<n 



tw(G tp ,ad' ) = max(|F — 1, max tw(G ipi ,ad'. )), 

Ki<n ' 



from which it follows that tw(G^, ad^) = tw(G^, ad^,). □ 

Remark 6.3. There exists a class of quantified constraint formulae having first order tree- 
width 1, where Chen and Dalmau's elimination-width is unbounded: let the class consist of 
the ip n as in Example l3.261 Then ad^ n = ad^ , and hence tw(G (/Pn , ad^, n ) = ew(G, Pn , ad^, n ) = 
n, while fotw(</? ra ) = 1. □ 

6.2. Non-recursive stratified datalog. In [16], Flum, Frick and Grohe define strict tree- 
width of non-recursive stratified datalog (nrsd) programs and they show that the evaluation 
problem for NRSD programs can be solved in polynomial time on programs of bounded strict 
tree- width |16l Corollary 5.26]. nrsd programs have the same expressive power as C and 
there are simple translations in both directions. This allows us to compare their notion with 
fotw. We show that if the NRSD program has tree-width at most k, then the corresponding 
first order formula has fotw at most k, and we exhibit a class of formulae with bounded 
fotw, whose corresponding nrsd programs have unbounded tree-width. 

We assume that the reader is familiar with datalog and we only fix our notation, and 
we refer the reader to [16] otherwise. A datalog rule p with negation is an expression 
Qx\ . . . xi <- Af=i ^«5 where Q is a relation symbol and xi, . . . ,x n E var(/\™ =1 A;) are pair- 
wise distinct variables, and the Aj are literals (i = 1, . . . , n). Qx\ ... x; is called the head of 
p, and AILi ^» ^ s caue d the body of p. To define the semantics, let A be a structure whose 
vocabulary contains all the relation symbols occurring in the body of p. Let y be a tuple that 
consists of all variables of var(/\™ =1 Aj) \ {x\, . . . , x{\, and let f p {x\ . . . x{) = By /\i <i<n \. 
We let p(A) := (p p (A). A non-recursive stratified datalog (nrsd) program is a sequence 
II = (IT 1 , . . . ,IT n ) of non-recursive datalog programs IP (called the strata of IT) as defined 
in [16]. We denote the intentional vocabulary of IT by int(II) and the extensional vocabulary 
of ITby ext(IT). 

The strict tree-width of a datalog rule p is defined as stw(p) := tw(GL,), and for an nrsd 
program IT = (IT , . . . ,IT n ) the strict tree-width of II is defined as stw(II) := max{stw( / o) | 
p G Ur=i n *}H Th e following is proved in [16], Corollary 5.26 (2). 

Theorem 6.4 (Flum, Frick, Grohe). For fixed integer k > 0, the evaluation problem for 
NRSD programs of strict tree-width at most k can be solved in polynomial time. □ 



In [16], the term strict tree- width refers to the fact that tree decompositions are required to cover all 
variables in the head of a datalog rule together in some piece. 
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It is well known that a query is definable in first order logic if and only if it is NRSD 
definable. Actually, an NRSD program IIq (i.e. an nrsd program with goal predicate Q) 
defines an equivalent first order formula ipn Q in a natural way, and vice versa. For proving 
our theorem, we make one direction explicit, associating a first order formula to an nrsd 
program as follows. 

Let II = (IT 1 , . . . , II n ) be an nrsd program and let Q £ int(IP) for some i < n. Suppose 
for all Q' £ int(II) occuring in (II , . . . , IP -1 ), the formula tpn n , is already defined. Let 

^n Q := V P> P , 

p£ll s , Q occurs in the head of p 

and let 

V?n Q := V/ f% 

pGlT, Q occurs in the head of p 

where ip* is obtained from <p p by recursively replacing relation symbols Q' £ int(il) occuring 
in (f p by the corresponding formula 'PUq, (he. <pu Q is an ext(II)-formula). Let A be an 
ext(II)-structure. It is easy to see that we have Hq(A) = ifYi Q (A). 

Recall that ad'^ is the mapping that assigns to a variable v £ var(ip) the alternation 
depth of v in ip. 

Theorem 6.5. Any NRSD program II with Q £ int(II) satisfies 
fotw(9Jn Q ) < tw(G ¥ , n ,ad / ) < stw(II). 

Proof. The first inequality follows from Theorem 13.11 and Lemma 16.21 

Towards the second inequality, let II = (II 1 , . . . , IP) and let k := stw(II) = max{stw(</? p ) 
P € UiLi ^i}- We P rove by induction on the number n of strata of II that all Q £ hit (II) 
satisfy tw(G ipn , ad^ n ) < k. Let Qx be the head corresponding to Q and let i be such 

that Q £ int(IP). Then (fiia b as exactly the free variables x and we have 

^n Q = V <p* p . 

pGll 1 , Q occurs in the head of p 

Suppose all Q' £ int(II) occuring in (II , . . . , IP -1 ) satisfy tw(ad' ¥ , n , G ipn ) < k. 

We may assume that ipu Q and (pn Q are straight. 

First we construct a tree decomposition for (pun as follows. For every p £ 11* with head 
Qx, we take a tree decomposition of width at most k of G Vp . Each of these decompositions 
has a piece containing the variables x. We glue them together at one new root covering 
the variables x. Then we orient the edges of the decomposition tree away from the root, 
and we obtain a tree decomposition (T, B) for <pn Q of width at most k. By the inductive 
hypothesis, for every Q' £ int(II) such that Q'y occurs in <pu Q , we have an ad^, n -stratified 

tree decomposition (T® , B® ) of <pii n i (y)- By definition, the variables in y are contained in 
the piece at the root r of T® . Moreover, y is covered in some piece of (T, B). We choose 
such a piece Bt and we attach (T® , B® ) to this piece such that r becomes a new successor 
of t. Having done this for all atoms Q'y (with Q ^ Q'), that occur in (pn Q , we obtain a tree 
decomposition (T, B) for G ipn of width at most k. We may assume that (pn Q is in negation 
normal form. (If not, we transform <p>n Q into negation normal form. Note that this does 
not change the formula graph.) Then, by construction, (T,B) is ad'^ n -stratified. □ 
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The following remark shows that the difference can be unbounded in the opposite 
direction. Moreover, it shows that the difference between Chen and Dalmau's elimination- 
width and tree-width of nrsd programs can be unbounded. 

Remark 6.6. There is a class C of nrsd programs with unbounded strict tree-width, such 
that fotw(</?n) = tw(G ipn ,ad' (pn ) = for all UeC. 

Proof. For an integer n > let ip n := 3x± . . . 3x n -iVa; n ( /\" =1 Pxj) . Take C to consist of the 
natural NRSD programs which are equivalent to the formulae ip n , for n > 0. □ 

6.3. Cops, Robbers and stratified tree- width. We now introduce the cops and robbers 
game as defined in |27| . Let G be a graph and let k > be an integer. The cops and robbers 
game on G (with game parameter k) is played by two players, the cop player and the robber 
player, on the graph G. The cop player controls k cops and the robber player controls the 
robber. Both the cops and the robber move on the vertices of G. Some of the cops move 
to at most k vertices and the robber stands on a vertex r not occupied by the cops. In 
each move, some of the cops fly in helicopters to at most k new vertices. During the flight, 
the robber sees which position the cops are approaching and before they land she quickly 
tries to escape by running arbitrarily fast along paths of G to a vertex r', not being allowed 
to run through a standing cop. Hence, if X C V(G) is the cops' first position, the robber 
stands on r € V(G) \ X, and after the flight, the cops occupy the set Y C V(G), then the 
robber can run to any vertex r' within the connected component of G \ (X D Y) containing 
r. The cops win if they land a cop via helicopter on the vertex occupied by the robber. The 
robber wins if she can always elude capture. Winning strategies are defined in the usual 
way. The cop-width of G, cw(G), is the minimum number of cops having a winning strategy 
on G. 

A winning strategy for the cops is monotone, if for all plays played according to the 
strategy, if Xi,X2, . . . is the sequence of cop positions, then the connected components Ri 
of G \ Xi containing the robber form a decreasing (with respect to C) sequence. The Ri 
are called the robber spaces. The monotone cop-width of G, mon-cw(G), is the minimum 
number of cops having a monotone winning strategy on G. 

Theorem 6.7 (Seymour, Thomas |27J). Any graph G satisfies tw(G) + 1 = cw(G) = 
mon-cw(G). □ 

Now let G be a graph and let d be a function d: V(G) — > N. The d-stratified cops 
and robbers game on G is played as the cops and robbers game on G, but in every move 
the cops have to satisfy the following additional condition. Intuitively, they can only clear 
vertices v with d{v) = i after they have cleared all vertices w with d(w) < i. More precisely: 
for every move (X, R), where X C V(G) is the cop position and R is the robber space, the 
cops have to make sure that max{d(x) | x G X} < min{d(r) \ r G R}. Then cw(G,d) and 
mon-cw(G, d) are defined analogously, and for a formula ip we let cw((p) := cw(G lfi ,ead lfi ) 
and mon-cw((/j) := mon-cw(G (jD ,ead ¥ ,). 

Although proving the following theorem is not very hard, it seems interesting to know 
that cw(G, d) and mon-cw(G, d) coincide. In many generalisations of the cops and robbers 
game to other settings, the analogous statements become false [HE1I26], and it might be 
helpful to explore the borderline. 
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Theorem 6.8. Let G be a graph and d : V(G) — > N. The following statements are equival- 
ent: 

(1) tw(G,d) <k-l, 

(2) mon-cw(G, d) < A;, and 

(3) cw(G,d) <k. 

In particular, any first order formula (p satisfies fotw(c^) + 1 = mon-cw(93) = cw(<p). 

Proof. 1 => 2: suppose tw(G, d) < A; — 1. Let (T,B) be a d-stratified tree decomposition 
of width at most k — 1 for C From (T,B), the fe cops can read off a monotone winning 
strategy in the usual way, first moving to B r and then following the robber down the tree 
decomposition into the unique direction where the robber space is covered (see e.g. |3j). 
Since (T, B) is d-stratified, the winning strategy is also d-stratified. 

2 => 3: any monotone winning strategy is a winning strategy. 

3 => 1: suppose k cops have a winning strategy for the d-stratified game on G. 

Claim 1. Let t,u be nodes of (G, d)'s component tree, where u is a child of t. Let x,y 6 
-Dt n D u . If, while playing against the A; cops, the robber can move to x and no cop will 
land on y, then the robber can also move to y. 

Let i be the depth oft in (G, d)'s component tree and let C be the connected component 
of G[C t \ D t ] with C U = CU N(C). As x, y € A we have x,y C, and hence i,t/£ N G (C) 
and there is a path from rr to y with all internal vertices in C. Hence d{z) > i > d(x) and 
d(z) > i > d{y) for all internal vertices of the path. Therefore, as long as the cops play on 
vertices with d at most i, the path is free and the robber can use it. But the cops can never 
move to a vertex with d > i before they have cleared Dt n D u completely, so the path from 
x to y is free whenever the robber can move to x. ■ 

Recall, from Section [U that the graph G^ ' is obtained from G by adding all edges 
between any pair of distinct vertices x,y € Dt PI D u for all directed edges (t, u) of the 
component tree. By the claim, any d-stratified winning strategy for k cops on G is also a d- 
stratified winning strategy on G^ ' . Forgetting d, obviously, k cops have a winning strategy 
on G'°) and hence in particular on G^°'[Dt] for all nodes t of (G, d)'s component tree. Thus 
cw(G^°'[Dt\) < k and by Theorem 16.71 this implies tw(G^°'[Dt\) < k — 1. From the tree 
decompositions of the G^ ' [Dt] of width < k we can now construct tree decompositions for 
the CA ) [Ct] of width < /c — 1 in a bottom-up manner as in the proof of Theorem 14.71 For 
the root r of (G, d)'s component tree, we have G'°) = G^[C r ] and it is easy to see that the 
tree decomposition for G^ ' [C r ] of width < k obtained in this way is d-stratified. □ 

7. Conclusion 

We introduced a notion of tree-width for first order formulae (p, fotw (<£>), generalising tree- 
width of conjunctive queries and elimination-width of quantified constraint formulae [9|. 
Our notion can also be seen as an adjustment of the notion of tree- width of first order 
formulae as defined in |16| (which only works for conjunctive queries with negation). 

We proved that computing fotw is fixed-parameter tractable with parameter fotw (The- 
orem (32|). Moreover, we showed that evaluating formulae of fc-bounded first order tree- 
width is fixed-parameter tractable, with parameter the length of the formula (Theorem 
I5.5p . This is done by first computing a tree decomposition of width at most k for the 
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formula, and then translating the formula equivalently into a formula of the k- variable frag- 
ment C of first order logic. It is well-known that evaluating C formulae can be done in 
polynomial time. When translating the formula </? into an equivalent C k formula, we get a 
non-elementary explosion in the running time. 

Conjecture 7.1. When translating a formula (p satisfying fotw((/j) < k into an equivalent 
£k f ormu i a5 a non-elementary explosion cannot be avoided. 

Moreover, it is still unknown whether the explosion can be avoided in parameterized 
algorithms for evaluating queries of bounded first order tree-width. 

We show that first order tree-width can be characterised by other notions such as 
elimination-width (Theorem 13. X[) . and the minimum number of cops necessary to catch the 
robber in the stratified cops and robbers game, as well as the minimum number of cops 
necessary in the monotone version of the game (Theorem I6.8[) . Hence our notion is very 
natural and robust. 

Moreover, we showed that fotw is more powerful than the notion of elimination-width 
of quantified constraint formulae as defined in [9]: for quantified constraint formulae, both 
bounded elimination-width and bounded fotw allow for model checking in polynomial time. 
We proved that if y is a quantified constraint formula, then fotw((/?) is bounded by the 
elimination-width of tp, and there are classes of quantified constraint formulae with bounded 
fotw and unbounded elimination- width. 

Finally, we showed that fotw is more powerful than tree- width of non-recursive stratified 
datalog (nrsd) programs [T6]. NRSD programs have the same expressive power as first order 
logic, in the sense that nrsd programs correspond to first order formulae and vice versa. 
We showed that first-order tree-width of (formula versions of) NRSD programs is bounded 
by the strict tree-width of the programs and that there are classes of first order formulae 
with bounded fotw, whose corresponding NRSD programs have unbounded strict tree-width. 

For conjunctive query evaluation, methods more powerful than bounded tree-width are 
known. Conjunctive queries of bounded hypertree- width [20], bounded fractional hypertree- 
width [23] and bounded (hyper)closure tree-width [4] yield even larger tractable classes of 
instances. For example, conjunctive queries of bounded hypertree-width correspond to the 
fc-guarded fragment of first order logic [21] , and similar correspondences can be found for the 
other invariants. Why not generalise these notions to first order formulae? By generalising 
these notions to first order formulae (p in the obvious way, a decomposition of bounded 
width would not give us an instruction how to translate (p into the corresponding guarded 
fragment of first order logic (transforming subformulae of ip into conjunctive normal form 
as in the proof of Theorem 15.51 1 does not necessarily yield guarded subformulae) . 

Nevertheless, generalising these notions to quantified constraint formulae should indeed 
yield classes with an efficient query evaluation, that are strictly larger than classes of quan- 
tified constraint formulae of bounded first order tree-width. It would be interesting to find 
the largest fragment of first order formulae for which such a generalization is possible. 
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